Guava is a fresh thinking digital marketing agency specialising in SEO, PPC, and Web Development
Want to know how to boost online sales in the run-up to Christmas? Register now
Talented and trained graduates with a passion for digital are available now through our new Digital Graduate Academy
Hear how Comet, John Lewis, Manutan •increased conversion up to 75%•online sales by 10% •saw £m’s extra sales
| reply to message » | post a new message » | e-mail to a friend » |
| Subject: | Allowing search engines to spider 'hidden' content - best practice? | |||
| Author: | DaveChaffey: view profile | all posts by this author | add to favourites | |||
| Date: | 08:57:28 22 August 2007 | |||
On 16:02:04 7 August 2007 Ashley wrote:
I should know this but I'm intrigued to hear from any SEO experts out there what the latest thinking / best practice is for allowing search engines to index restricted content e.g. content that sits behind a pay-access log in, or other barrier?
We're looking at converting all our current file content (e.g. Word files, PDFs etc.), which are mostly paid-access only research and guides, into XHTML so that we can display them as HTML or allow users to convert them (e.g. to PDF) on the fly. This will also make it easier to syndicate our content, present it on other devices, "reskin" the presentation layer and so on.
But it also would allow us to make all the contents of a report (e.g. a 200 page Word file) available for indexing by a search engine. In theory this is good because there is a lot of great, niche, content in these documents which would be great for long tail SEO and attracting high-converting traffic.
But, of course, we wouldn't want the user to actually get access to the full content itself without first paying. Nor would we want all this pay-access content existing in Google's cache.
I guess, in theory, we could allow the Googlebot and chosen other spiders to index this content as HTML but not allow real humans or other agents to do so. I believe we might be able to use the robots.txt protocol to prevent caching too?
But in the case of the above we are showing Google something that we are not showing our users - and isn't this cloaking?
However, Google's Book search seems to work in just this way so Google don't appear to be averse to indexing intellectual property / content in this way but without revealing it all?
Any thoughts / pointers / experiences welcome...
Thanks
Ashley Friedlein
CEO
E-consultancy.com
Allowing search engines to spider 'hidden' content - best practice?, Ashley
, 7 Aug 16:02
Allowing search engines to spider 'hidden' content - best practice?, stevejohnston, 7 Aug 17:00
Allowing search engines to spider 'hidden' content - best practice?, Ashley
, 7 Aug 17:26
Re: Allowing search engines to spider 'hidden' content - best practice?, Ashley
, 8 Aug 14:13
Allowing search engines to spider 'hidden' content - best practice?, PaulRudman, 7 Aug 17:43
RE: Allowing search engines to spider 'hidden' content - best practice?, Ashley
, 7 Aug 17:51
RE: Allowing search engines to spider 'hidden' content - best practice?, PaulRudman, 7 Aug 18:01
Allowing search engines to spider 'hidden' content - best practice?, AdamCrawford, 7 Aug 18:54
Allowing search engines to spider 'hidden' content - best practice?, teddie, 8 Aug 07:25
RE: Allowing search engines to spider 'hidden' content - best practice?, Ashley
, 8 Aug 09:30
Allowing search engines to spider 'hidden' content - best practice?, Loz, 8 Aug 12:29
RE: Allowing search engines to spider 'hidden' content - best practice?, Ashley
, 8 Aug 13:13
Allowing search engines to spider 'hidden' content - best practice?, TomStuart
, 17 Aug 10:16
Allowing search engines to spider 'hidden' content - best practice?, DaveChaffey, 22 Aug 08:57
Allowing search engines to spider 'hidden' content - best practice?, JamesOwen, 22 Aug 09:50
RE: Allowing search engines to spider 'hidden' content - best practice?, Ashley
, 23 Aug 10:40
Allowing search engines to spider 'hidden' content - best practice?, DaveChaffey, 24 Aug 07:42