[pmwiki-users] Important regarding Google AdSense was Re: odd page saving behavior - googlebot ghost posts?

Crisses crisses at kinhost.org
Sat Oct 28 04:19:25 CDT 2006


Ok, I had made a markup to put a Google AdSense ad on similepedia.com

//long google block markup
Markup('Google1', 'block',
         '/\\(:Google1:\\)/e',
         Keep("<script type='text/javascript'><!--
google_ad_client = '#################';
google_ad_width = 120;
google_ad_height = 600;
google_ad_format = '120x600_as';
google_ad_type = 'text_image';
google_ad_channel ='';
//--></script>
<script type='text/javascript'
src='http://pagead2.googlesyndication.com/pagead/show_ads.js'>
</script>"));

With this I could just put (:Google1:) where I wanted the client's  
AdSense to appear.  Easy huh?  Sure worked fine...

Much later, I created a page rating plug-in script (soon to be a  
recipe) page action that had ended with:

	UpdatePage($pagename, $oldpage, $newpage);
	HandleBrowse($pagename);
	exit();


Apparently if the AdSense is called while there are $_POST vars, the  
$_POST vars get echoed back at the site when performing the  
HandleBrowse -- I had to add an unset($_POST) to stop the page rating  
echo coming from the googlebots.

	UpdatePage($pagename, $oldpage, $newpage);
	unset($_POST);
	HandleBrowse($pagename);
	exit();

The echo details, and what was happening to the wiki.d pages are  
enumerated below, and didn't happen while developing my plug-in on my  
laptop via localhost.

I have NO idea why the Gogglebot is echoing the $_POST back to the  
site when the AdSense loads on the page.... but this is the fix that  
worked for my script.  I thought perhaps inquiring developers would  
want to know....

Crisses

On Oct 26, 2006, at 12:13 AM, Crisses wrote:

> On a localhost server, my new page-rating script is just peachy.
>
> On the live server, I get some really bizzare behavior.  According to
> the Pmwiki wiki.d files, immediately after a page is rated, a
> googlebot (according to a dig -x on the IP address) that is NOT
> logged in (last author changes to blank) and thus NOT allowed to edit
> the page adds an identical rating to the page.
>
> This doesn't happen on localhost.
>
> wiki.d text -- note IP 66.249.66.2 (and it isn't always the same
> address)
>
> 2.66.249.66.in-addr.arpa. 86400 IN      PTR
> crawl-66-249-66-2.googlebot.com.
>
> ;; AUTHORITY SECTION:
> 66.249.66.in-addr.arpa. 25354   IN      NS      ns3.google.com.
> ...
>
>
>
> version=pmwiki-2.2.0-beta15 ordered=1 urlencoded=1
> RatingAverage=4
> RatingCount=2
> RatingSum=8
> agent=Mozilla/5.0 (Macintosh; U; PPC Mac OS X; en) AppleWebKit/418.9
> (KHTML, like Gecko) Safari/419.3
> author=crisses
> host=69.86.108.178
> name=Simile.AsILayDying:FaulknerWilliam:HeSoundedLikeA
> rev=9
> targets=Category.Author-FaulknerWilliam,Category.Title-
> AsILayDying,Category.Year-1930,Category.Period-20thCentury,Category.Ty 
> pe
> -Fiction,Category.Publisher-Vintage
> text=(:Rating_:5:)%0a(:Rating_crisses:3:)%0a(:div class=category:)[[!
> Author-FaulknerWilliam]] [[!Title-AsILayDying]] [[!Year-1930]] [[!
> Period-20thCentury]] [[!Type-Fiction]] [[!Publisher-Vintage]]%0a
> (:divend:)
> time=1161834113
> author:1161834113=crisses
> diff:1161834113:1161834026:=2c2%0a%3c (:Rating_crisses:3:)%0a---%0a>
> (:Rating_crisses:5:)%0a
> host:1161834113=69.86.108.178
> author:1161834026=
> diff:1161834026:1161834025:=
> host:1161834026=66.249.66.2
> author:1161834025=
> diff:1161834025:1161834023:=1c1%0a%3c (:Rating_:5:)%0a---%0a>
> (:Rating_:3:)%0a
> host:1161834025=66.249.66.2
> author:1161834023=crisses
> diff:1161834023:1161833952:=2c2%0a%3c (:Rating_crisses:5:)%0a---%0a>
> (:Rating_crisses:3:)%0a
> host:1161834023=69.86.108.178
> author:1161833952=
> diff:1161833952:1161833951:=
> host:1161833952=66.249.66.2
> author:1161833951=
> diff:1161833951:1161833949:=1d0%0a%3c (:Rating_:3:)%0a
> host:1161833951=66.249.66.2
> author:1161833949=crisses
> diff:1161833949:1160766956:=1d0%0a%3c (:Rating_crisses:3:)%0a
> host:1161833949=69.86.108.178
> author:1160766956=****** Client's Login ******
> diff:1160766956:1160766956:=1,2d0%0a%3c (:div class=category:)[[!
> Author-FaulknerWilliam]] [[!Title-AsILayDying]] [[!Year-1930]] [[!
> Period-20thCentury]] [[!Type-Fiction]] [[!Publisher-Vintage]]%0a%3c
> (:divend:)%0a\ No newline at end of file%0a
> host:1160766956=***** Client's IP*****
>
>
> The other IP I've seen on a different page was 66.249.66.193 -- also
> a googlebot address.
>
> Now, I really don't think this is something I programmed into the
> script -- a googlebot isn't logged in and shouldn't even SEE the
> forms in question.  I dumped all the key variables at several points
> during execution, until just before:
>
> 	UpdatePage($pagename, $oldpage, $newpage);
> 	HandleBrowse($pagename);
> 	exit();
>
> At the end of my script.  I would think I'm saying "Update the page,
> then browse the page, and make sure you quit."
>
> During my execution, all the variables look good.  The next time I
> refresh the page, there's this strange echo with no author that has a
> googlebot IP address.  It's obviously running through my script
> because the RatingCount=2 means it incremented the counter, but I'm
> not getting the variable dumps I put into the script, which means the
> script is running after my browser session shuts down.
>
> This should be in no way related to the register_shutdown_functions I
> have in Main.AddSimile.php -- the pages in question aren't even in
> the Main group, and my custom scripts are triggered by specific
> action keywords and aren't supposed to even be loaded much less run
> while browsing the Simile group....
>
>
> Here's $newpage after the UpdatePage is run:
>
> Array
> (
>      [version] => pmwiki-2.2.0-beta14 ordered=1 urlencoded=1
>      [agent] => Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:
> 1.8.0.7) Gecko/20060909 Firefox/1.5.0.7
>      [author] => crisses
>      [host] => **** Client IP ******
>      [name] => Simile.AsILayDying:FaulknerWilliam:IFeelLikeA
>      [rev] => 2
>      [targets] => Category.Author-FaulknerWilliam,Category.Title-
> AsILayDying,Category.Year-1930,Category.Period-20thCentury,Category.Ty 
> pe
> -Fiction,Category.Publisher-Vintage
>      [text] => (:Rating_crisses:4:)
> (:div class=category:)[[!Author-FaulknerWilliam]] [[!Title-
> AsILayDying]] [[!Year-1930]] [[!Period-20thCentury]] [[!Type-
> Fiction]] [[!Publisher-Vintage]]
> (:divend:)
>      [time] => 1160766956
>      [author:1160766956] => **** Client ID *****
>      [diff:1160766956:1160766956:] => 1,2d0
> < (:div class=category:)[[!Author-FaulknerWilliam]] [[!Title-
> AsILayDying]] [[!Year-1930]] [[!Period-20thCentury]] [[!Type-
> Fiction]] [[!Publisher-Vintage]]
> < (:divend:)
> \ No newline at end of file
>
>      [host:1160766956] => **** Client IP ******
>      [RatingCount] => 1
>      [RatingSum] => 4
>      [RatingAverage] => 4
>      [author:1161835706] => crisses
>      [host:1161835706] => 69.86.108.178
>      [diff:1161835706:1160766956:] => 1d0
> < (:Rating_crisses:4:)
>
> )
>
>
> This would be 100% correct -- if it didn't change a moment later.
>
>
> Here's the page text on the disk after the user has control of the
> browser again, without making another browser transaction:
>
> version=pmwiki-2.2.0-beta15 ordered=1 urlencoded=1
> RatingAverage=4
> RatingCount=2
> RatingSum=8
> agent=Mediapartners-Google/2.1
> author=
> host=66.249.66.2
> name=Simile.AsILayDying:FaulknerWilliam:IFeelLikeA
> rev=5
> targets=Category.Author-FaulknerWilliam,Category.Title-
> AsILayDying,Category.Year-1930,Category.Period-20thCentury,Category.Ty 
> pe
> -Fiction,Category.Publisher-Vintage
> text=(:Rating_:4:)%0a(:Rating_crisses:4:)%0a(:div class=category:)[[!
> Author-FaulknerWilliam]] [[!Title-AsILayDying]] [[!Year-1930]] [[!
> Period-20thCentury]] [[!Type-Fiction]] [[!Publisher-Vintage]]%0a
> (:divend:)
> time=1161835709
> author:1161835709=
> diff:1161835709:1161835708:=
> host:1161835709=66.249.66.2
> author:1161835708=
> diff:1161835708:1161835706:=1d0%0a%3c (:Rating_:4:)%0a
> host:1161835708=66.249.66.2
> author:1161835706=crisses
> diff:1161835706:1160766956:=1d0%0a%3c (:Rating_crisses:4:)%0a
> host:1161835706=69.86.108.178
> author:1160766956=**** Client ID ******
> diff:1160766956:1160766956:=1,2d0%0a%3c (:div class=category:)[[!
> Author-FaulknerWilliam]] [[!Title-AsILayDying]] [[!Year-1930]] [[!
> Period-20thCentury]] [[!Type-Fiction]] [[!Publisher-Vintage]]%0a%3c
> (:divend:)%0a\ No newline at end of file%0a
> host:1160766956=*****Client IP ******
>
>
>
> Any clues?
>
> Crisses
>
> _______________________________________________
> pmwiki-users mailing list
> pmwiki-users at pmichaud.com
> http://www.pmichaud.com/mailman/listinfo/pmwiki-users
>
>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: /pipermail/pmwiki-users/attachments/20061028/5f5f6c27/attachment.html 


More information about the pmwiki-users mailing list