[pmwiki-users] odd page saving behavior - googlebot ghost posts?

Crisses crisses at kinhost.org
Wed Oct 25 23:13:56 CDT 2006


On a localhost server, my new page-rating script is just peachy.

On the live server, I get some really bizzare behavior.  According to  
the Pmwiki wiki.d files, immediately after a page is rated, a  
googlebot (according to a dig -x on the IP address) that is NOT  
logged in (last author changes to blank) and thus NOT allowed to edit  
the page adds an identical rating to the page.

This doesn't happen on localhost.

wiki.d text -- note IP 66.249.66.2 (and it isn't always the same  
address)

2.66.249.66.in-addr.arpa. 86400 IN      PTR      
crawl-66-249-66-2.googlebot.com.

;; AUTHORITY SECTION:
66.249.66.in-addr.arpa. 25354   IN      NS      ns3.google.com.
...



version=pmwiki-2.2.0-beta15 ordered=1 urlencoded=1
RatingAverage=4
RatingCount=2
RatingSum=8
agent=Mozilla/5.0 (Macintosh; U; PPC Mac OS X; en) AppleWebKit/418.9  
(KHTML, like Gecko) Safari/419.3
author=crisses
host=69.86.108.178
name=Simile.AsILayDying:FaulknerWilliam:HeSoundedLikeA
rev=9
targets=Category.Author-FaulknerWilliam,Category.Title- 
AsILayDying,Category.Year-1930,Category.Period-20thCentury,Category.Type 
-Fiction,Category.Publisher-Vintage
text=(:Rating_:5:)%0a(:Rating_crisses:3:)%0a(:div class=category:)[[! 
Author-FaulknerWilliam]] [[!Title-AsILayDying]] [[!Year-1930]] [[! 
Period-20thCentury]] [[!Type-Fiction]] [[!Publisher-Vintage]]%0a 
(:divend:)
time=1161834113
author:1161834113=crisses
diff:1161834113:1161834026:=2c2%0a%3c (:Rating_crisses:3:)%0a---%0a>  
(:Rating_crisses:5:)%0a
host:1161834113=69.86.108.178
author:1161834026=
diff:1161834026:1161834025:=
host:1161834026=66.249.66.2
author:1161834025=
diff:1161834025:1161834023:=1c1%0a%3c (:Rating_:5:)%0a---%0a>  
(:Rating_:3:)%0a
host:1161834025=66.249.66.2
author:1161834023=crisses
diff:1161834023:1161833952:=2c2%0a%3c (:Rating_crisses:5:)%0a---%0a>  
(:Rating_crisses:3:)%0a
host:1161834023=69.86.108.178
author:1161833952=
diff:1161833952:1161833951:=
host:1161833952=66.249.66.2
author:1161833951=
diff:1161833951:1161833949:=1d0%0a%3c (:Rating_:3:)%0a
host:1161833951=66.249.66.2
author:1161833949=crisses
diff:1161833949:1160766956:=1d0%0a%3c (:Rating_crisses:3:)%0a
host:1161833949=69.86.108.178
author:1160766956=****** Client's Login ******
diff:1160766956:1160766956:=1,2d0%0a%3c (:div class=category:)[[! 
Author-FaulknerWilliam]] [[!Title-AsILayDying]] [[!Year-1930]] [[! 
Period-20thCentury]] [[!Type-Fiction]] [[!Publisher-Vintage]]%0a%3c  
(:divend:)%0a\ No newline at end of file%0a
host:1160766956=***** Client's IP*****


The other IP I've seen on a different page was 66.249.66.193 -- also  
a googlebot address.

Now, I really don't think this is something I programmed into the  
script -- a googlebot isn't logged in and shouldn't even SEE the  
forms in question.  I dumped all the key variables at several points  
during execution, until just before:

	UpdatePage($pagename, $oldpage, $newpage);
	HandleBrowse($pagename);
	exit();

At the end of my script.  I would think I'm saying "Update the page,  
then browse the page, and make sure you quit."

During my execution, all the variables look good.  The next time I  
refresh the page, there's this strange echo with no author that has a  
googlebot IP address.  It's obviously running through my script  
because the RatingCount=2 means it incremented the counter, but I'm  
not getting the variable dumps I put into the script, which means the  
script is running after my browser session shuts down.

This should be in no way related to the register_shutdown_functions I  
have in Main.AddSimile.php -- the pages in question aren't even in  
the Main group, and my custom scripts are triggered by specific  
action keywords and aren't supposed to even be loaded much less run  
while browsing the Simile group....


Here's $newpage after the UpdatePage is run:

Array
(
     [version] => pmwiki-2.2.0-beta14 ordered=1 urlencoded=1
     [agent] => Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv: 
1.8.0.7) Gecko/20060909 Firefox/1.5.0.7
     [author] => crisses
     [host] => **** Client IP ******
     [name] => Simile.AsILayDying:FaulknerWilliam:IFeelLikeA
     [rev] => 2
     [targets] => Category.Author-FaulknerWilliam,Category.Title- 
AsILayDying,Category.Year-1930,Category.Period-20thCentury,Category.Type 
-Fiction,Category.Publisher-Vintage
     [text] => (:Rating_crisses:4:)
(:div class=category:)[[!Author-FaulknerWilliam]] [[!Title- 
AsILayDying]] [[!Year-1930]] [[!Period-20thCentury]] [[!Type- 
Fiction]] [[!Publisher-Vintage]]
(:divend:)
     [time] => 1160766956
     [author:1160766956] => **** Client ID *****
     [diff:1160766956:1160766956:] => 1,2d0
< (:div class=category:)[[!Author-FaulknerWilliam]] [[!Title- 
AsILayDying]] [[!Year-1930]] [[!Period-20thCentury]] [[!Type- 
Fiction]] [[!Publisher-Vintage]]
< (:divend:)
\ No newline at end of file

     [host:1160766956] => **** Client IP ******
     [RatingCount] => 1
     [RatingSum] => 4
     [RatingAverage] => 4
     [author:1161835706] => crisses
     [host:1161835706] => 69.86.108.178
     [diff:1161835706:1160766956:] => 1d0
< (:Rating_crisses:4:)

)


This would be 100% correct -- if it didn't change a moment later.


Here's the page text on the disk after the user has control of the  
browser again, without making another browser transaction:

version=pmwiki-2.2.0-beta15 ordered=1 urlencoded=1
RatingAverage=4
RatingCount=2
RatingSum=8
agent=Mediapartners-Google/2.1
author=
host=66.249.66.2
name=Simile.AsILayDying:FaulknerWilliam:IFeelLikeA
rev=5
targets=Category.Author-FaulknerWilliam,Category.Title- 
AsILayDying,Category.Year-1930,Category.Period-20thCentury,Category.Type 
-Fiction,Category.Publisher-Vintage
text=(:Rating_:4:)%0a(:Rating_crisses:4:)%0a(:div class=category:)[[! 
Author-FaulknerWilliam]] [[!Title-AsILayDying]] [[!Year-1930]] [[! 
Period-20thCentury]] [[!Type-Fiction]] [[!Publisher-Vintage]]%0a 
(:divend:)
time=1161835709
author:1161835709=
diff:1161835709:1161835708:=
host:1161835709=66.249.66.2
author:1161835708=
diff:1161835708:1161835706:=1d0%0a%3c (:Rating_:4:)%0a
host:1161835708=66.249.66.2
author:1161835706=crisses
diff:1161835706:1160766956:=1d0%0a%3c (:Rating_crisses:4:)%0a
host:1161835706=69.86.108.178
author:1160766956=**** Client ID ******
diff:1160766956:1160766956:=1,2d0%0a%3c (:div class=category:)[[! 
Author-FaulknerWilliam]] [[!Title-AsILayDying]] [[!Year-1930]] [[! 
Period-20thCentury]] [[!Type-Fiction]] [[!Publisher-Vintage]]%0a%3c  
(:divend:)%0a\ No newline at end of file%0a
host:1160766956=*****Client IP ******



Any clues?

Crisses




More information about the pmwiki-users mailing list