Hi all, for what I've read, it suffices to generate a sitemap file with MediaWiki and how to submit it to Google. There is a script for that: generateSitemap.php. Once done, the sitemap has to be updated regularly in order to include the new pages.
If it is more complicated, I hope that in Singapore, directly speaking to people, can solve the matter. Cheers, A. *Ruthven* on Wikipedia On Tue, 1 Aug 2023 at 12:03, Sohom Datta <[email protected]> wrote: > For everyone : >> > Plus, the problem is not *better* indexing (that would be great of >> course), the problem here is that a lot (if not most) Wikisource pages are >> *not* indexed! >> If you type "when men went up in balloons" or "The banners rustle in the >> breeze" in Google (taken from the previous examples), Wikisource doesn't >> appear at all (and for other examples, when it does, it's incomplete and >> with a long delay of - at least - several months, the first example was >> done last week, the second was created in February!). >> That's why some of us suspect something is wrong somewhere (maybe because >> of the proofread extension ? and that Google doesn't "see" our pages. > > I've tested a bunch of non-indexed pages (some of which are from 2011) > across multiple Wikisources, it shows up as "URL is available to Google'' > (indicating that Googlebot can see the page). I am pretty sure that there > is no issue wrt to this specific thing on ProofreadPage/Wikisource > extension's side (there could be one on Google's end?) > > More gadget you use, more unreadable is the pages to the machines. > > > > Try to have a blind person reading the Wikisource pages with their own >> tools. Will it be difficult for them? Found the problem. > > > > You want to create pages which are more user friendly? But you do it for >> people having eyes. Machines doesn't have eyes. > > I agree with this on principle, but Wikisource pages are by default > readable without any javascript. And testing with Google bot (on the search > console) shows that it is able to read the associated content. > > I think this is a good opportunity to discuss with Google's Search Team >> here in Singapore in 2 weeks time. > > This would definitely be great :) > > Regards, > Sohom Datta > --- > Open-source contributor @Wikimedia > > > On Tue, Aug 1, 2023 at 3:18 PM Nicolas VIGNERON < > [email protected]> wrote: > >> Thanks Ilario, >> That's very good general advice but would you have more concrete and >> specific ones? >> For instance, what would you change on >> https://en.wikisource.org/wiki/Baltimore_American/Volume_192/Issue_34,925/Travel_by_Balloon >> or >> https://en.wikisource.org/wiki/The_Poetical_Works_of_William_Motherwell/The_Crusader%27s_Farewell >> ? >> >> For everyone : >> Plus, the problem is not *better* indexing (that would be great of >> course), the problem here is that a lot (if not most) Wikisource pages are >> *not* indexed! >> If you type "when men went up in balloons" or "The banners rustle in the >> breeze" in Google (taken from the previous examples), Wikisource doesn't >> appear at all (and for other examples, when it does, it's incomplete and >> with a long delay of - at least - several months, the first example was >> done last week, the second was created in February!). >> That's why some of us suspect something is wrong somewhere (maybe because >> of the proofread extension ? and that Google doesn't "see" our pages. >> >> Cheers, >> Nicolas >> >> Le mar. 1 août 2023 à 11:24, Ilario valdelli <[email protected]> a >> écrit : >> >>> To better index, you must follow as much as possible structured page. >>> >>> More gadget you use, more unreadable is the pages to the machines. >>> >>> Try to have a blind person reading the Wikisource pages with their own >>> tools. Will it be difficult for them? Found the problem. >>> >>> You want to create pages which are more user friendly? But you do it for >>> people having eyes. Machines doesn't have eyes. >>> >>> KInd regards >>> >>> On 01/08/2023 08:47, Bodhisattwa wrote: >>> > Hello all, >>> > >>> > Apologies for cross-posting. >>> > >>> > For those who have not noticed till now, Google is not indexing any >>> > Wikisource language editions for the last couple of years which >>> > practically means that any Wikisource contents in any languages, which >>> > are being created in these years, are not searchable on Google and >>> > hence largely remain invisible on the web. >>> > >>> > This is an extremely demotivating and frustrating situation for the >>> > existing Wikisource volunteers to witness, draining away all of our >>> > past and current efforts to bring and retain viewers, readers, GLAM >>> > partners and any potential new editors. We already have a very low >>> > awareness and visibility about Wikisource among general internet users >>> > due to lack of organized support in these years but the invisibility >>> > on Google search engine could become the last nail in our coffin, >>> > unless it is fixed soon. >>> > >>> > There is a phabricator ticket raised by Darwinius back in December >>> > 2022 - https://phabricator.wikimedia.org/T325607. >>> > >>> > Can't this issue be put into priority by sys admins and WMF to work >>> > upon? Wikisource is still a sister project of Wikimedia and it needs >>> > some very basic care, after all. >>> > >>> > Regards, >>> > Bodhisattwa >>> > (Bengali Wikisource volunteer) >>> > >>> > >>> > _______________________________________________ >>> > Wikisource-l mailing list -- [email protected] >>> > To unsubscribe send an email to [email protected] >>> >>> -- >>> Ilario Valdelli >>> Wikimedia CH >>> Verein zur Förderung Freien Wissens >>> Association pour l’avancement des connaissances libre >>> Associazione per il sostegno alla conoscenza libera >>> Switzerland - 8008 Zürich >>> Wikipedia: Ilario >>> Skype: valdelli >>> Tel: +41764821371 >>> http://www.wikimedia.ch >>> _______________________________________________ >>> Wikisource-l mailing list -- [email protected] >>> To unsubscribe send an email to [email protected] >>> >> _______________________________________________ >> Wikisource-l mailing list -- [email protected] >> To unsubscribe send an email to [email protected] >> > _______________________________________________ > Wikisource-l mailing list -- [email protected] > To unsubscribe send an email to [email protected] >
_______________________________________________ Wikisource-l mailing list -- [email protected] To unsubscribe send an email to [email protected]
