This is an automated notification sent by LCG Savannah.
It relates to:
                task #8149, project CDS Invenio

==============================================================================
 LATEST MODIFICATIONS of task #8149:
==============================================================================

Update of task #8149 (project cdsware):

                Category:                    None => WebStyle               
                  Status:                    Done => None                   
        Percent Complete:                    100% => 50%                    
             Assigned to:                 skaplun => simko                  
             Open/Closed:                  Closed => Open                   

    _______________________________________________________

Follow-up Comment #1:

On the contrary, it is desirable to let crawlers index all I18N collection
splash pages, because  if someone searches in google.fr for, say:

  "th?ses de doctorat au CERN"

then if we would have had properly translated collection names on CDSWEB
(that we do not have still), then those users would discover our pages
only if we allow indexing of `?ln=fr'.

***

It is therefore better to solve the Google language detection issue in
another way, e.g. by providing HTML meta headers like:

  <meta http-equiv="Content-Language" content="fr">

for French pages.

This is done now. (committed 2008-11-27)

***

That said, it would be indeed useful to populate {noindex,nofollow}
properties in
our pages everywhere, e.g. in the detailed record tabs.

==============================================================================
 OVERVIEW of task #8149:
==============================================================================

URL:
  <http://savannah.cern.ch/task/?8149>

                 Summary: GoogleBot and language support
                 Project: CDS Invenio
            Submitted by: skaplun
            Submitted on: 2008-10-16 09:31
         Should Start On: 2008-10-16 00:00
   Should be Finished on: 2008-10-16 00:00
                Category: WebStyle
                Priority: 5 - Normal
                  Status: None
                 Privacy: Private
        Percent Complete: 50%
             Assigned to: simko
             Open/Closed: Open
         Discussion Lock: Any
                  Effort: 0.00

    _______________________________________________________


GoogleBot is indexing all the languages of CDS interface. This might not be
desirable since in any case the record in CDS are mainly in English.

This is due to the sitemap sporting all the language URLs and to letting
Google follow the language URLs at the bottom. We might wish to add a
'rel="nofollow"' attribute to such URLs and to instruct sitemap generator not
to generate a link per every language.

    _______________________________________________________

Follow-up Comments:


-------------------------------------------------------
Date: 2008-11-27 16:52              By: Tibor Simko <simko>
On the contrary, it is desirable to let crawlers index all I18N collection
splash pages, because  if someone searches in google.fr for, say:

  "th?ses de doctorat au CERN"

then if we would have had properly translated collection names on CDSWEB
(that we do not have still), then those users would discover our pages
only if we allow indexing of `?ln=fr'.

***

It is therefore better to solve the Google language detection issue in
another way, e.g. by providing HTML meta headers like:

  <meta http-equiv="Content-Language" content="fr">

for French pages.

This is done now. (committed 2008-11-27)

***

That said, it would be indeed useful to populate {noindex,nofollow}
properties in
our pages everywhere, e.g. in the detailed record tabs.





    _______________________________________________________

Carbon-Copy List:

CC Address                          | Comment
------------------------------------+-----------------------------
1576                                | -COM-
2195                                | -SUB-




==============================================================================

This item URL is:
  <http://savannah.cern.ch/task/?8149>

_______________________________________________
  Message sent via/by LCG Savannah
  http://savannah.cern.ch/


Reply via email to