Re: [BRLTTY] Is there a feature-compatible text-based browser

Kyle Thu, 02 Oct 2025 17:00:04 -0700

I do find all this AI stuff to be very interesting indeed, although ashas been said, I can't see it replacing the screen reader entirely; itmay work better as a supplemental tool, much like the way BeMyAIcomplements and supplements the BeMyEyes volunteer. As it stands now, Ican actually ask several open source AI models to describe a picturetaken with my phone's camera and get a halfway decent response back inabout the same time as it takes to upload the same picture to BeMyEyesand wait for its AI to come back with a response.

The main problem with the AI replacing the screen reader though is notspeed, but hallucination, which is still a huge problem with every modelI've ever used for any purpose. I mean BeMyAI described one of the 2000gold dollar coins as a giant penny, complete with Lincoln's face andall. But I knew it was hallucinating, because I knew exactly what coin Iwas holding. Many times when the AI hallucinates, we don't know that iswhat is happening.

And the bigger problem is that I don't want my computer to try to thinkfor me. My workflow is pretty straightforward. I want to do something, Ieither look in the menu or type in a command to find it, the computerdoes it. Or I'm on a website, I want to see what is on the page, and ifI'm lucky, the headers are marked so that I at least get a nice idea.And if I want a page summary before I get started, my screen reader cando that at the press of a button. I don't see any benefit of AI here,with the obvious exception of text recognition or helping to map out anotherwise inaccessible window so that its characteristics can be sent tothe screen reader, which could then read what was sent to it by the AIas I interact with it normally. I don't want a detailed description ofthe whole window, only the control I'm focusing on at the time and anytext that may need my attention when the window pops up. AI descriptionsnow are still a bit too wordy, sometimes leading to additional confusionrather than a straightforward workflow.

Yes, I for one enjoy the graphical desktop and the consistency that itprovides, i.e. one key combination has its function everywhere insteadof all these little programs having different key sequences that all endup doing the same thing; e.g. if I press a q here, it closes theapplication, but if I press it in another application, nothing happens,and I was supposed to use control+x, which incidentally is the cutcommand *everywhere* on the MATE desktop and GNOME as well. I noticenothing slow about writing this message in Thunderbird, which used to bepretty darn slow just 10 or so years ago, but is smooth now, and this ison a laptop that is about 8 years old and a desktop that is about 12. SoI can see how in the future, AI may become useful enough locally that itcan be just as fast as interacting with a graphical desktop is now. Butthen most AI models still rely too heavily on the GPU, so this may comeat some point down the road, not I fear in the next year or so, butmaybe in the next 10. Still, the above problems of hallucination and itsattempts to think for me are still a bit off-putting to me, even if theycan fix the lag problems it would introduce now.


~Kyle

_______________________________________________
This message was sent via the BRLTTY mailing list.
To post a message, send an e-mail to: [email protected]
For general information, go to: http://brltty.app/mailman/listinfo/brltty

Re: [BRLTTY] Is there a feature-compatible text-based browser

Reply via email to