Text literally is vision, a sequence of objects/ frames. When you look across a map, that too is a frame change. You can only see a single feature at a time, be it a page full of text or a word in hat text. Text was made to be efficient and is standalone holding all the information you need. And a neural network doesn't care about which type of data, the way it "works" fundamentally is nearly if not exactly identical and OpenAI uses theirs on vision, text, music.
Our sub conscious makes decisions, like typo recognition similar word and position recognition, it does this very fast without "ponder of thoughts" and doesn't wait (it may even make you have sex or avoid a car) and just enough to make an O-K prediction and never can reach "100% is sure" since harder the more % you seek. Different prompts/ questions/ problems have different thresholds. Conscious decision making takes longer, we are searching possible paths of sequences, and we wait, stop, repeat paths, hold onto them, then finally settle on an answer after some time threshold. ------------------------------------------ Artificial General Intelligence List: AGI Permalink: https://agi.topicbox.com/groups/agi/T224cb80d0cc8b0e7-M147e91f677cdd70eecefbbe3 Delivery options: https://agi.topicbox.com/groups/agi/subscription
