Yes, this is definitely along the lines of what we'd ideally like to do.
The relevant Google News page (http://www.google.com/support/newspub/bin/answer.py?answer=40543) strongly suggests that they actually don't mind you selectively allowing the Googlebot through a pay wall, which from a publisher's perspective provides the best of both worlds (we get our content indexed while it remains fully protected), but clearly that's not going to create a spectacular user experience for anyone clicking through from a search result.
However, I'm concerned about the technical logistics of "First Click Free" -- namely how trivial it is to circumvent it. Anyone with a rudimentary grasp of IT can install a browser extension to spoof their HTTP referer, or disable cookies, or even (with a bit more effort) make consecutive HTTP requests appear to come from different IP addresses, and faced with that combination of techniques it becomes impossible to distinguish between someone who's genuinely just landed from a search result and someone who's done 30 seconds of browser configuration and is now trawling the site downloading all the content.
There's always the old argument that publishers can do no better than provide a mild deterrent, and that anyone who's determined enough to crack the system is essentially welcome to invest the time and effort necessary to do so, but here we're talking about a really small effort versus a really big payoff. It's one thing for, say, the Washington Post to allow First Click Free on the grounds that their content is essentially ephemeral and largely advertising-funded in the first place, but E-consultancy's content has a high and lasting value that we can't afford to compromise so readily.
Does anyone have any technical insights about how to safely release useful chunks of content to users arriving from search engine results, without also making it very easy for any half-competent user to subvert this system to his advantage? The fundamentally anonymous and stateless nature of the web makes this a very difficult problem to solve reliably. We are extremely keen to provide the richest possible user experience, but preferably not at the expense of our content assets!
Allowing search engines to spider 'hidden' content - best practice?, Ashley , 7 Aug 16:02 I should know this but I'm intrigued to hear from any SEO experts out there what the latest thinking / best practice is for allowing search engines to index restricted content e.g. ...
Allowing search engines to spider 'hidden' content - best practice?, stevejohnston, 7 Aug 17:00 The principles here are very simple, and in fact you do know it really, but you just wish it weren't so:
Any time you serve content at a specific URL to the Googlebot that is di ...
Allowing search engines to spider 'hidden' content - best practice?, Ashley , 7 Aug 17:26 I can see how this is 'cloaking' in as much as we are showing Google something the user can't access but we are not trying to deceive anyone - the content is there, we're not redir ...
Allowing search engines to spider 'hidden' content - best practice?, PaulRudman, 7 Aug 17:43 Hi Ashley,
This is something we've raised with several clients, as one of them had 300,000+ pages hidden behind a login screen, all potentially very valuable data in not only in ...
Allowing search engines to spider 'hidden' content - best practice?, AdamCrawford, 7 Aug 18:54 Hi Ashley
Google's "First Click Free" system for subscription based content on Google News can also be used with the main natural search index I believe. The following links ma ...
Allowing search engines to spider 'hidden' content - best practice?, teddie, 8 Aug 07:25 I'll side with Adam on this.
The decision is not how to completely hide your content, but how difficult you want to make it for users to get it for free that paying/registering ...
Allowing search engines to spider 'hidden' content - best practice?, Loz, 8 Aug 12:29 Are you suggesting having all 32 pages of the Online Lead Generation Report, for example, accessible to search engines but limit the number pages a user can access before the login ...
Allowing search engines to spider 'hidden' content - best practice?, TomStuart , 17 Aug 10:16 Yes, this is definitely along the lines of what we'd ideally like to do.
The relevant Google News page (http://www.google.com/support/newspub/bin/answer.py?answer=40543) strongl ...
Allowing search engines to spider 'hidden' content - best practice?, DaveChaffey, 22 Aug 08:57 I've come late to this party - holidays! and can't add much from an SEO POV, but I think there are some other alternatives to consider.
What’s not really covered is the impact o ...
Allowing search engines to spider 'hidden' content - best practice?, JamesOwen, 22 Aug 09:50 Quote: So you could have a mechanism of restricting access at the chapter/topic level – first 50 words maybe, but make all the detailed content sub-topic available, and so exploit ...
Allowing search engines to spider 'hidden' content - best practice?, DaveChaffey, 24 Aug 07:42 I wasn't exactly thinking directory, but yes it would make sense to have a multi-level site map related for this deep content for easier nav and internal linking purposes - good id ...