<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Google Proxy Hacking: How A Third Party Can Remove Your Site From Google SERPs</title>
	<atom:link href="http://www.seofaststart.com/blog/google-proxy-hacking/feed" rel="self" type="application/rss+xml" />
	<link>http://www.seofaststart.com/blog/google-proxy-hacking</link>
	<description>Frustrated by the often-confusing and conflicting SEO advice you find on the web? The best SEO book is free - stop searching and download it today!</description>
	<lastBuildDate>Fri, 02 Jul 2010 03:56:55 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
	<item>
		<title>By: Hawaiian Shirts</title>
		<link>http://www.seofaststart.com/blog/google-proxy-hacking/comment-page-6#comment-23902</link>
		<dc:creator>Hawaiian Shirts</dc:creator>
		<pubDate>Thu, 11 Sep 2008 23:02:29 +0000</pubDate>
		<guid isPermaLink="false">http://www.seofaststart.com/blog/google-proxy-hacking#comment-23902</guid>
		<description>&lt;p&gt;I have been hearing about this kind of stuff for quite some time now.  Is it still going on or has Google fixed the problem by now?&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>I have been hearing about this kind of stuff for quite some time now.  Is it still going on or has Google fixed the problem by now?</p>]]></content:encoded>
	</item>
	<item>
		<title>By: Graeme Pietersz</title>
		<link>http://www.seofaststart.com/blog/google-proxy-hacking/comment-page-6#comment-22161</link>
		<dc:creator>Graeme Pietersz</dc:creator>
		<pubDate>Thu, 21 Aug 2008 05:27:08 +0000</pubDate>
		<guid isPermaLink="false">http://www.seofaststart.com/blog/google-proxy-hacking#comment-22161</guid>
		<description>&lt;p&gt;Some pages of my site were briefly replaced by pages from a proxy.&lt;/p&gt;

&lt;p&gt;I complained to Google (with an online spam form, not a DMCA letter or any formal avenue) and they have been removed from the index.  I also emailed their hosting company and the site is now down.&lt;/p&gt;

&lt;p&gt;They were definitely doing it deliberately. Their URL resembled that of a legitimate site, and their front page redirected to proxy that site.&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>Some pages of my site were briefly replaced by pages from a proxy.</p>

<p>I complained to Google (with an online spam form, not a DMCA letter or any formal avenue) and they have been removed from the index.  I also emailed their hosting company and the site is now down.</p>

<p>They were definitely doing it deliberately. Their URL resembled that of a legitimate site, and their front page redirected to proxy that site.</p>]]></content:encoded>
	</item>
	<item>
		<title>By: Wick-edly Sent Scented Candles</title>
		<link>http://www.seofaststart.com/blog/google-proxy-hacking/comment-page-6#comment-19833</link>
		<dc:creator>Wick-edly Sent Scented Candles</dc:creator>
		<pubDate>Wed, 21 May 2008 20:50:29 +0000</pubDate>
		<guid isPermaLink="false">http://www.seofaststart.com/blog/google-proxy-hacking#comment-19833</guid>
		<description>&lt;p&gt;This is very distressing, I often wondered if this applies to articles when we distribute them.  I&#039;ve placed articles on my site and send them through Ezine or Isnare, would that be considered duplicate content?  And does just the homepage get affected, or the entire site?&lt;/p&gt;

&lt;p&gt;Thanks for sharing this info.
WS&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>This is very distressing, I often wondered if this applies to articles when we distribute them.  I&#039;ve placed articles on my site and send them through Ezine or Isnare, would that be considered duplicate content?  And does just the homepage get affected, or the entire site?</p>

<p>Thanks for sharing this info.
WS</p>]]></content:encoded>
	</item>
	<item>
		<title>By: Hami</title>
		<link>http://www.seofaststart.com/blog/google-proxy-hacking/comment-page-6#comment-19346</link>
		<dc:creator>Hami</dc:creator>
		<pubDate>Sun, 11 May 2008 16:42:49 +0000</pubDate>
		<guid isPermaLink="false">http://www.seofaststart.com/blog/google-proxy-hacking#comment-19346</guid>
		<description>&lt;p&gt;wow! great article... my site was in google for 7 months.. but suddnely disappeared from last two weeks....&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>wow! great article&#8230; my site was in google for 7 months.. but suddnely disappeared from last two weeks&#8230;.</p>]]></content:encoded>
	</item>
	<item>
		<title>By: Dan Thies</title>
		<link>http://www.seofaststart.com/blog/google-proxy-hacking/comment-page-6#comment-19183</link>
		<dc:creator>Dan Thies</dc:creator>
		<pubDate>Thu, 08 May 2008 16:41:50 +0000</pubDate>
		<guid isPermaLink="false">http://www.seofaststart.com/blog/google-proxy-hacking#comment-19183</guid>
		<description>&lt;p&gt;Igor, buddy... it&#039;s been fun but I&#039;m gonna have to shut you down now.&lt;/p&gt;

&lt;p&gt;There&#039;s no difference between a spoofed header from curl and any other spoofed header.&lt;/p&gt;

&lt;p&gt;Even if it were possible to fake both ends of the forward and reverse lookup, it&#039;s hardly necessary, if your only goal is to construct a proxy that will fetch a page and return it to a search engine&#039;s bot.&lt;/p&gt;

&lt;p&gt;All you have to do is look like a normal user, strip out robot meta tags and X-Robots headers, and pass the result back.&lt;/p&gt;

&lt;p&gt;If this problem reappears (Google seems to have solved it for now), and a bunch of idiots start doing that, we have a number of additional countermeasures which would thwart the attempt.&lt;/p&gt;

&lt;p&gt;I will not discuss the nature of those countermeasures unless it becomes necessary to engage the issue again.&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>Igor, buddy&#8230; it&#039;s been fun but I&#039;m gonna have to shut you down now.</p>

<p>There&#039;s no difference between a spoofed header from curl and any other spoofed header.</p>

<p>Even if it were possible to fake both ends of the forward and reverse lookup, it&#039;s hardly necessary, if your only goal is to construct a proxy that will fetch a page and return it to a search engine&#039;s bot.</p>

<p>All you have to do is look like a normal user, strip out robot meta tags and X-Robots headers, and pass the result back.</p>

<p>If this problem reappears (Google seems to have solved it for now), and a bunch of idiots start doing that, we have a number of additional countermeasures which would thwart the attempt.</p>

<p>I will not discuss the nature of those countermeasures unless it becomes necessary to engage the issue again.</p>]]></content:encoded>
	</item>
	<item>
		<title>By: John Hill</title>
		<link>http://www.seofaststart.com/blog/google-proxy-hacking/comment-page-6#comment-19181</link>
		<dc:creator>John Hill</dc:creator>
		<pubDate>Thu, 08 May 2008 16:23:48 +0000</pubDate>
		<guid isPermaLink="false">http://www.seofaststart.com/blog/google-proxy-hacking#comment-19181</guid>
		<description>&lt;p&gt;Webmaster I am not, but I am fascinated by your article.  This morning my website which has been up for years, disappeared from Google&#039;s index.  For months it was #1 when I searched for many variations of juice plus john or juice plus canada.  I have spent the day trying to find out where it went and how to get it back.  Your article opened a whole new possibility.  I&#039;d appreciate any tips.&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>Webmaster I am not, but I am fascinated by your article.  This morning my website which has been up for years, disappeared from Google&#039;s index.  For months it was #1 when I searched for many variations of juice plus john or juice plus canada.  I have spent the day trying to find out where it went and how to get it back.  Your article opened a whole new possibility.  I&#039;d appreciate any tips.</p>]]></content:encoded>
	</item>
	<item>
		<title>By: Igor The Troll</title>
		<link>http://www.seofaststart.com/blog/google-proxy-hacking/comment-page-6#comment-19180</link>
		<dc:creator>Igor The Troll</dc:creator>
		<pubDate>Thu, 08 May 2008 16:07:12 +0000</pubDate>
		<guid isPermaLink="false">http://www.seofaststart.com/blog/google-proxy-hacking#comment-19180</guid>
		<description>&lt;p&gt;One way, if it is possible it would be to check on each one of your pages, if the request is from culr_init() If it is cross reference the IP to your allowed bots IPs if it is on the list allow it.&lt;/p&gt;

&lt;p&gt;This would at least protect you from the home made proxies.&lt;/p&gt;

&lt;p&gt;But I did a search for checking if a request is curl_init and found nothing.&lt;/p&gt;

&lt;p&gt;As far as someone pretending to be an IP that they are not is very easy to do! Piggyback on another IP! Have that IP make the request. You can also use IP spoofing to make the handshake.&lt;/p&gt;

&lt;p&gt;Read this.
http://www.computing.net/answers/programming/php-remoteaddr-integrity/9140.html&lt;/p&gt;

&lt;p&gt;Tell us if you find a way to check for curl_init() request.&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>One way, if it is possible it would be to check on each one of your pages, if the request is from culr_init() If it is cross reference the IP to your allowed bots IPs if it is on the list allow it.</p>

<p>This would at least protect you from the home made proxies.</p>

<p>But I did a search for checking if a request is curl_init and found nothing.</p>

<p>As far as someone pretending to be an IP that they are not is very easy to do! Piggyback on another IP! Have that IP make the request. You can also use IP spoofing to make the handshake.</p>

<p>Read this.
<a href="http://www.computing.net/answers/programming/php-remoteaddr-integrity/9140.html" rel="nofollow">http://www.computing.net/answers/programming/php-remoteaddr-integrity/9140.html</a></p>

<p>Tell us if you find a way to check for curl_init() request.</p>]]></content:encoded>
	</item>
	<item>
		<title>By: Dan Thies</title>
		<link>http://www.seofaststart.com/blog/google-proxy-hacking/comment-page-6#comment-19173</link>
		<dc:creator>Dan Thies</dc:creator>
		<pubDate>Thu, 08 May 2008 14:09:30 +0000</pubDate>
		<guid isPermaLink="false">http://www.seofaststart.com/blog/google-proxy-hacking#comment-19173</guid>
		<description>&lt;p&gt;Drak,&lt;/p&gt;

&lt;p&gt;What you describe is a pretty standard step in preventing content theft - this particular issue is a bit more subtle.&lt;/p&gt;

&lt;p&gt;That&#039;s a nice way to identify bad bots if you disallow /trap in robots.txt - since the bad guys rarely bother to read that file, and if they do it&#039;s usually the first place they look. :D&lt;/p&gt;

&lt;p&gt;But it wouldn&#039;t do much good when you&#039;re trying to identify good bots coming in by proxy, unless you plan to allow them to spider past a hidden link.&lt;/p&gt;

&lt;p&gt;It also would not prevent them from grabbing other content (like your home page) at a proxy URL while you&#039;re waiting for them to request the trap page.&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>Drak,</p>

<p>What you describe is a pretty standard step in preventing content theft &#8211; this particular issue is a bit more subtle.</p>

<p>That&#039;s a nice way to identify bad bots if you disallow /trap in robots.txt &#8211; since the bad guys rarely bother to read that file, and if they do it&#039;s usually the first place they look. <img src='http://www.seofaststart.com/wp-includes/images/smilies/icon_biggrin.gif' alt=':D' class='wp-smiley' /> </p>

<p>But it wouldn&#039;t do much good when you&#039;re trying to identify good bots coming in by proxy, unless you plan to allow them to spider past a hidden link.</p>

<p>It also would not prevent them from grabbing other content (like your home page) at a proxy URL while you&#039;re waiting for them to request the trap page.</p>]]></content:encoded>
	</item>
	<item>
		<title>By: Drak</title>
		<link>http://www.seofaststart.com/blog/google-proxy-hacking/comment-page-6#comment-19152</link>
		<dc:creator>Drak</dc:creator>
		<pubDate>Thu, 08 May 2008 09:11:03 +0000</pubDate>
		<guid isPermaLink="false">http://www.seofaststart.com/blog/google-proxy-hacking#comment-19152</guid>
		<description>&lt;p&gt;There is a much easier way to detect bots using a spider trap.  You simply hide a link somewhere on your site that would be invisible to normal users /trap/trap.php&lt;/p&gt;

&lt;p&gt;Then this script takes note of the IP address and adds it to your .htaccess script blocking the IP after validating if it is a bot you want to deal with (using ARIN lookups).  You can have other scripts that expire the blocks or count the number of return offenses.&lt;/p&gt;

&lt;p&gt;Drak&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>There is a much easier way to detect bots using a spider trap.  You simply hide a link somewhere on your site that would be invisible to normal users /trap/trap.php</p>

<p>Then this script takes note of the IP address and adds it to your .htaccess script blocking the IP after validating if it is a bot you want to deal with (using ARIN lookups).  You can have other scripts that expire the blocks or count the number of return offenses.</p>

<p>Drak</p>]]></content:encoded>
	</item>
	<item>
		<title>By: Igor The Troll</title>
		<link>http://www.seofaststart.com/blog/google-proxy-hacking/comment-page-6#comment-19102</link>
		<dc:creator>Igor The Troll</dc:creator>
		<pubDate>Wed, 07 May 2008 17:18:24 +0000</pubDate>
		<guid isPermaLink="false">http://www.seofaststart.com/blog/google-proxy-hacking#comment-19102</guid>
		<description>&lt;p&gt;&quot;If my server gets a request that appears to come from 11.22.33.44, my server will send the response there. If that&#039;s not your actual IP, you won&#039;t get the response, will you?&quot;&lt;/p&gt;

&lt;p&gt;Very good and logical question. Not sure about this. I would imagine not, but who knows! Is the response based on the TCP/IP handshake or on browser sessions?&lt;/p&gt;

&lt;p&gt;Need to consult Dark Vader on this and will get back to you as more information gets illuminated from the Dark Side.&lt;/p&gt;

&lt;p&gt;May the force be with us to defeat Evil Google!&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>&#034;If my server gets a request that appears to come from 11.22.33.44, my server will send the response there. If that&#039;s not your actual IP, you won&#039;t get the response, will you?&#034;</p>

<p>Very good and logical question. Not sure about this. I would imagine not, but who knows! Is the response based on the TCP/IP handshake or on browser sessions?</p>

<p>Need to consult Dark Vader on this and will get back to you as more information gets illuminated from the Dark Side.</p>

<p>May the force be with us to defeat Evil Google!</p>]]></content:encoded>
	</item>
</channel>
</rss>
