Dear Tim, In respect to the `citation_abstract_html_url` being the handle (from handle.net, thus a completely different URL) and not a subdomain of the original website:
I am concerned that this may be causing issues with the indexing process. Interestingly, I have another repository without a custom handle, and it is being indexed correctly. If you have time, could you message the Google Scholar team and maybe ask them that? I understand that implementing the necessary changes is a simple fix. However, I would like to ensure that it is indeed a problem before proceeding. Regards Santiago. On Wednesday, July 26, 2023 at 7:02:13 PM UTC+2 DSpace Community wrote: > Hi Santiago, > > I receive regular updates from the Google Scholar team on DSpace indexing. > I tend to talk with them a few times a year & receive updates about any > common issues they've found with indexing DSpace sites. They have never > mentioned that the date format in "citation_publication_date" is an issue. > If they ever do, we'd treat it as a bug and get it fixed. > > However, I can verify that the "citation_publication_date" field simply > uses the same date as your "dc.date.issued" metadata field on the Item. > So, if you modify the "dc.date.issued" that will change the value in your > "citation_publication_date" meta tag. But, I do not believe that is > necessary for Google Scholar to index your site. > > Similarly, I've not heard of any issues with "citation_abstract_html_url" > being the handle. This field takes its value from the "dc.identifier.uri" > metadata field on your Item. So, if the Item has a different value in that > field, it will be used in the "citation_abstract_html_url". > > Overall, it is possible to modify the behavior of these Google Scholar > tags in DSpace 7. But, you have to modify the behavior of the > corresponding "setCitation*Tag()" method in the "metadata.service.ts" file > in the UI. You'd then need to recompile and restart the UI. For instance, > here's the method that sets the "citation_abstract_html_url" tag value: > https://github.com/DSpace/dspace-angular/blob/main/src/app/core/metadata/metadata.service.ts#L286 > > Tim > > On Wednesday, July 26, 2023 at 2:02:35 AM UTC-5 [email protected] > wrote: > >> Also, `citation_date` is not formatted as required by Google. This is a >> problem? >> >> I don't know if we have to follow the format "obligatorily": >> >> Provide full dates in the "2010/5/12" format if available; or a year >> alone otherwise. >> >> And the last question: it is okay for the citation_abstract_html_url to >> be a handle URL (handle.net), isn't it? >> >> I really don't know why we are not indexed by Google. >> >> Sorry to bother you with all these questions. >> >> Regards, >> Santiago. >> >> On Wednesday, July 26, 2023 at 8:51:57 AM UTC+2 Santiago Lo Coco wrote: >> >>> I fixed the problem by adding this line: >>> >>> proxy_set_header Host $host; >>> >>> Regards, >>> Santiago. >>> >>> On Wednesday, July 26, 2023 at 8:34:30 AM UTC+2 Santiago Lo Coco wrote: >>> >>>> Thank you Tim. >>>> >>>> The problem is that I already done that. >>>> >>>> This is my nginx config for the frontend: >>>> >>>> location / { >>>> proxy_set_header X-Forwarded-Proto https; >>>> proxy_set_header X-Forwarded-Host $host; >>>> proxy_set_header X-Forwarded-Server $host; >>>> proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for; >>>> proxy_pass http://localhost:4000; >>>> } >>>> >>>> Do you know if there is a mistake? >>>> >>>> I also add some `add_header` directives for debugging and they are >>>> working perfectly. >>>> >>>> This is the output of `curl -v`: >>>> >>>> X-Forwarded-Host: repositorio.uflo.edu.ar >>>> X-Forwarded-Proto: https >>>> >>>> Regards, >>>> Santiago. >>>> On Tuesday, July 25, 2023 at 10:35:03 PM UTC+2 DSpace Community wrote: >>>> >>>>> Hi Santiago, >>>>> >>>>> This sounds like this issue which is documented on our SEO guide in >>>>> the official docs: >>>>> https://wiki.lyrasis.org/display/DSDOC7x/Search+Engine+Optimization#SearchEngineOptimization-EnsureyourproxyispassingX-ForwardedheaderstotheUserInterface >>>>> >>>>> I suspect you need to add both X-Forwarded-Proto and X-Forwarded-Host >>>>> headers to your proxy. >>>>> >>>>> Tim >>>>> >>>>> On Tuesday, July 25, 2023 at 2:52:12 PM UTC-5 [email protected] >>>>> wrote: >>>>> >>>>>> Hello, >>>>>> >>>>>> I am having a problem. For some reason, google scholar isn't indexing >>>>>> my website. The website generates almost all the *meta *tags >>>>>> perfectly. For example, with this article >>>>>> <https://repositorio.uflo.edu.ar/entities/art%C3%ADculo/d5c80da7-b559-48fb-bf2b-c532671900a3> >>>>>> : >>>>>> >>>>>> <meta name="citation_title" content="Revisión sistemática sobre >>>>>> psicoterapias efectivas y/o tratamientos combinados con pacientes con >>>>>> severidad y comorbilidad"> >>>>>> <meta name="citation_author" content="Scherb, Elena Diana"> >>>>>> <meta name="citation_publication_date" content="2022-05"> >>>>>> <meta name="citation_issn" content="2602-8379"> >>>>>> <meta name="citation_language" content="es"> >>>>>> <meta name="citation_keywords" content="PSICOTERAPIA; >>>>>> PSICOPATOLOGIA"> >>>>>> <meta name="citation_abstract_html_url" content=" >>>>>> https://hdl.handle.net/20.500.14340/909"> >>>>>> <meta name="citation_publisher" content="Universidad Estatal de >>>>>> Milagro"> >>>>>> <meta name="citation_pdf_url" content=" >>>>>> https://localhost:4000/bitstreams/ea3094d6-1d9f-4c42-88aa-7bb038833f79/download >>>>>> "> >>>>>> >>>>>> The only wrong tag and that is maybe the one that is causing the >>>>>> index to fail is the last one. How can I fix it? Because I can hardcode >>>>>> it >>>>>> in the frontend file but there must be a better solution. I am using a >>>>>> nginx proxy with the X-.. tags correctly. >>>>>> >>>>>> Do you know if this is the problem causing google scholar to not >>>>>> index our website? >>>>>> >>>>>> Regards, >>>>>> Santiago. >>>>>> >>>>>> -- All messages to this mailing list should adhere to the Code of Conduct: https://www.lyrasis.org/about/Pages/Code-of-Conduct.aspx --- You received this message because you are subscribed to the Google Groups "DSpace Community" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/dspace-community/b7d7415c-44fc-4ca1-8c23-481dbc8c7472n%40googlegroups.com.
