Sergey Chernyshev wrote: > It seems that inline warnings are being crawled and indexed by Google > which is quite bad. > > Here's home Google listing for one of my pages looks: > > *JavaScript: The Good Parts* - Technical Presentations > <http://www.techpresentations.org/JavaScript:_The_Good_Parts> > > warning.pngSorry, URIs from the range > "http://www.techpresentations.org{{#mediapath:*JavaScript* *The Good > Parts*.jpg}}" are not available in this place. *...* > www.techpresentations.org/ > <http://www.techpresentations.org/>*JavaScript*:_The_*Good*_*Parts* > - 18k - > > > I fixed the error and google is probably going to update it eventualy, > but still, it's not very good idea to have embedded HTML in there - > maybe it's better to have them inserted using JS instead... it might > help with enabling/disabling it on per-user basis as well.
You might be able to use the googleoff/on comment tags. You want to turn off snippet and index, but you can probably just turn off everything with <!--googleoff: all--> warning HTML stuff <!--googleon: all --> Details at <http://code.google.com/apis/searchappliance/documentation/46/admin_crawl/Preparing.html#pagepart> This definitely works for the Google Search Appliance, but I can't find conclusive evidence whether Google's own Web crawler respects these tags. Google tries to be smart about what to display in snippets, I'm not sure what heuristics work these days to discourage it. Try looking at Google's cached version of your page for clues. With JavaScript enabled, the SMW warnings are surrounded by <span style="display: none">, but the Google crawler sees the page with the warning in a regular <div>. <http://googlewebmastercentral.blogspot.com/2007/09/improve-snippets-with-meta-description.html> suggests you can control the snippet using a <META NAME="Description" CONTENT = "blah blah" /> tag, you might be able to change your skin to output something here. You can turn off the snippet altogether with <META NAME="GOOGLEBOT" CONTENT="NOSNIPPET"> , see <http://www.google.com/support/webmasters/bin/answer.py?answer=35304> -- =S Page ------------------------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ _______________________________________________ Semediawiki-devel mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/semediawiki-devel
