[pmwiki-users] odd page saving behavior - googlebot ghost posts?
Crisses
crisses at kinhost.org
Wed Oct 25 23:13:56 CDT 2006
On a localhost server, my new page-rating script is just peachy.
On the live server, I get some really bizzare behavior. According to
the Pmwiki wiki.d files, immediately after a page is rated, a
googlebot (according to a dig -x on the IP address) that is NOT
logged in (last author changes to blank) and thus NOT allowed to edit
the page adds an identical rating to the page.
This doesn't happen on localhost.
wiki.d text -- note IP 66.249.66.2 (and it isn't always the same
address)
2.66.249.66.in-addr.arpa. 86400 IN PTR
crawl-66-249-66-2.googlebot.com.
;; AUTHORITY SECTION:
66.249.66.in-addr.arpa. 25354 IN NS ns3.google.com.
...
version=pmwiki-2.2.0-beta15 ordered=1 urlencoded=1
RatingAverage=4
RatingCount=2
RatingSum=8
agent=Mozilla/5.0 (Macintosh; U; PPC Mac OS X; en) AppleWebKit/418.9
(KHTML, like Gecko) Safari/419.3
author=crisses
host=69.86.108.178
name=Simile.AsILayDying:FaulknerWilliam:HeSoundedLikeA
rev=9
targets=Category.Author-FaulknerWilliam,Category.Title-
AsILayDying,Category.Year-1930,Category.Period-20thCentury,Category.Type
-Fiction,Category.Publisher-Vintage
text=(:Rating_:5:)%0a(:Rating_crisses:3:)%0a(:div class=category:)[[!
Author-FaulknerWilliam]] [[!Title-AsILayDying]] [[!Year-1930]] [[!
Period-20thCentury]] [[!Type-Fiction]] [[!Publisher-Vintage]]%0a
(:divend:)
time=1161834113
author:1161834113=crisses
diff:1161834113:1161834026:=2c2%0a%3c (:Rating_crisses:3:)%0a---%0a>
(:Rating_crisses:5:)%0a
host:1161834113=69.86.108.178
author:1161834026=
diff:1161834026:1161834025:=
host:1161834026=66.249.66.2
author:1161834025=
diff:1161834025:1161834023:=1c1%0a%3c (:Rating_:5:)%0a---%0a>
(:Rating_:3:)%0a
host:1161834025=66.249.66.2
author:1161834023=crisses
diff:1161834023:1161833952:=2c2%0a%3c (:Rating_crisses:5:)%0a---%0a>
(:Rating_crisses:3:)%0a
host:1161834023=69.86.108.178
author:1161833952=
diff:1161833952:1161833951:=
host:1161833952=66.249.66.2
author:1161833951=
diff:1161833951:1161833949:=1d0%0a%3c (:Rating_:3:)%0a
host:1161833951=66.249.66.2
author:1161833949=crisses
diff:1161833949:1160766956:=1d0%0a%3c (:Rating_crisses:3:)%0a
host:1161833949=69.86.108.178
author:1160766956=****** Client's Login ******
diff:1160766956:1160766956:=1,2d0%0a%3c (:div class=category:)[[!
Author-FaulknerWilliam]] [[!Title-AsILayDying]] [[!Year-1930]] [[!
Period-20thCentury]] [[!Type-Fiction]] [[!Publisher-Vintage]]%0a%3c
(:divend:)%0a\ No newline at end of file%0a
host:1160766956=***** Client's IP*****
The other IP I've seen on a different page was 66.249.66.193 -- also
a googlebot address.
Now, I really don't think this is something I programmed into the
script -- a googlebot isn't logged in and shouldn't even SEE the
forms in question. I dumped all the key variables at several points
during execution, until just before:
UpdatePage($pagename, $oldpage, $newpage);
HandleBrowse($pagename);
exit();
At the end of my script. I would think I'm saying "Update the page,
then browse the page, and make sure you quit."
During my execution, all the variables look good. The next time I
refresh the page, there's this strange echo with no author that has a
googlebot IP address. It's obviously running through my script
because the RatingCount=2 means it incremented the counter, but I'm
not getting the variable dumps I put into the script, which means the
script is running after my browser session shuts down.
This should be in no way related to the register_shutdown_functions I
have in Main.AddSimile.php -- the pages in question aren't even in
the Main group, and my custom scripts are triggered by specific
action keywords and aren't supposed to even be loaded much less run
while browsing the Simile group....
Here's $newpage after the UpdatePage is run:
Array
(
[version] => pmwiki-2.2.0-beta14 ordered=1 urlencoded=1
[agent] => Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:
1.8.0.7) Gecko/20060909 Firefox/1.5.0.7
[author] => crisses
[host] => **** Client IP ******
[name] => Simile.AsILayDying:FaulknerWilliam:IFeelLikeA
[rev] => 2
[targets] => Category.Author-FaulknerWilliam,Category.Title-
AsILayDying,Category.Year-1930,Category.Period-20thCentury,Category.Type
-Fiction,Category.Publisher-Vintage
[text] => (:Rating_crisses:4:)
(:div class=category:)[[!Author-FaulknerWilliam]] [[!Title-
AsILayDying]] [[!Year-1930]] [[!Period-20thCentury]] [[!Type-
Fiction]] [[!Publisher-Vintage]]
(:divend:)
[time] => 1160766956
[author:1160766956] => **** Client ID *****
[diff:1160766956:1160766956:] => 1,2d0
< (:div class=category:)[[!Author-FaulknerWilliam]] [[!Title-
AsILayDying]] [[!Year-1930]] [[!Period-20thCentury]] [[!Type-
Fiction]] [[!Publisher-Vintage]]
< (:divend:)
\ No newline at end of file
[host:1160766956] => **** Client IP ******
[RatingCount] => 1
[RatingSum] => 4
[RatingAverage] => 4
[author:1161835706] => crisses
[host:1161835706] => 69.86.108.178
[diff:1161835706:1160766956:] => 1d0
< (:Rating_crisses:4:)
)
This would be 100% correct -- if it didn't change a moment later.
Here's the page text on the disk after the user has control of the
browser again, without making another browser transaction:
version=pmwiki-2.2.0-beta15 ordered=1 urlencoded=1
RatingAverage=4
RatingCount=2
RatingSum=8
agent=Mediapartners-Google/2.1
author=
host=66.249.66.2
name=Simile.AsILayDying:FaulknerWilliam:IFeelLikeA
rev=5
targets=Category.Author-FaulknerWilliam,Category.Title-
AsILayDying,Category.Year-1930,Category.Period-20thCentury,Category.Type
-Fiction,Category.Publisher-Vintage
text=(:Rating_:4:)%0a(:Rating_crisses:4:)%0a(:div class=category:)[[!
Author-FaulknerWilliam]] [[!Title-AsILayDying]] [[!Year-1930]] [[!
Period-20thCentury]] [[!Type-Fiction]] [[!Publisher-Vintage]]%0a
(:divend:)
time=1161835709
author:1161835709=
diff:1161835709:1161835708:=
host:1161835709=66.249.66.2
author:1161835708=
diff:1161835708:1161835706:=1d0%0a%3c (:Rating_:4:)%0a
host:1161835708=66.249.66.2
author:1161835706=crisses
diff:1161835706:1160766956:=1d0%0a%3c (:Rating_crisses:4:)%0a
host:1161835706=69.86.108.178
author:1160766956=**** Client ID ******
diff:1160766956:1160766956:=1,2d0%0a%3c (:div class=category:)[[!
Author-FaulknerWilliam]] [[!Title-AsILayDying]] [[!Year-1930]] [[!
Period-20thCentury]] [[!Type-Fiction]] [[!Publisher-Vintage]]%0a%3c
(:divend:)%0a\ No newline at end of file%0a
host:1160766956=*****Client IP ******
Any clues?
Crisses
More information about the pmwiki-users
mailing list