Matt said: The overview claims to be able to convert natural language sentences into Cycl assertions, and to convert questions to Cycl queries. So I wonder why the knowledge base is still not being built this way. And I wonder why there is no public demo of the interface, and no papers giving verifiable experimental results.
>From my employment at Cycorp, 1999-2006, I can answer your questions, and also >say why I am pursuing a natural language approach with Texai, extending the >OpenCyc ontology. First understand that Doug Lenat is a mathematician, not a >linguist nor cognitive psychologist. The Cyc Project began, in the 1980's, >hand-entering knowledge in a predicate calculus schema that was intentionally >agnostic with respect to natural languages. To this day, increasingly >sophisticated tools have enabled Cycorp's ontologists to more precisely and >more rapidly enter knowledge and perform queries, than they can by relying on >rather incomplete and relatively poorly performing English language tools. >For example, I witnessed some very long parsing times for input sentences over >15 words. In contrast, new concepts can be defined and positioned in the >existing ontology using point and click, non-NL screens very rapidly if the >ontologist is fully prepared with respect to what they want to accomplish - for example using the "Create Similar" tool. Addressing the public demo, I believe that first Cycorp does not want to expend the considerable effort to create and maintain a publicly accessible NL interface of high quality, when there are so many other areas of Cyc that sponsors are paying for that need attention. Secondly, I believe that because Cyc is proprietary, it precludes a public NL interface that enables the proprietary content to be extracted. Furthermore, the effort required to create some reasonable, but small example public partition would be prevented by my first observation. During my tenure, Cycorp had no fewer than three PhD computational linguists continuously employed on NL interfaces. But I believe their progress has been blunted by the need to maintain a large number of legacy parsers, all of which have some dead-end (i.e. not cognitively plausible) characteristics. Furthermore, the Cyc NL parsing and NL generation systems are at least two completely different bodies of code. Therefore Cyc is not capable of understanding all of what it can say and vice versa. Moreover, Cyc's NL system, similar to its other long-standing code components, is quite large, and sadly demanding that the great majority of the developer's time is spent maintaining, fixing, migrating, rewriting and tailoring the existing code rather than adding new functionality. In previous posts here and on my blog, I have described the Texai system as an English dialog system to achieve AGI via bootstrapping a small code base. By extending OpenCyc's ontology, and in particular biasing it towards the semantics of English language constructions, I hope to avoid some of the problems I saw at Cycorp. Matt also said: It seems to me the main limitation is that the language model has to be described formally in Cycl, as a lexicon and rules for parsing and disambiguation.t seems to me the main limitation is that the language model has to be described formally in Cycl, as a lexicon and rules for parsing and disambiguation. Agreed. For the Texai language model, I employ Fluid Construction Grammar as the encoding, and Double-R Theory for its grammatical constructs. Although I currently hand-write these in an symbolic-expression external format, they are serialized into RDF from corresponding Java rule objects, and stored in the Texai KB as any other assertion. My plan is to task the dialog system first to interact with its mentors to acquire new vocabulary (e.g. mappings from word senses to Cyc concepts, argument mappings to event roles, etc.) and new grammar constructions (e.g. "on the table" as a phase can have as one of its senses an instance of cyc:Negotiating). Those wanting to know more about Cyc should attend my Cyc tutorial at AGI-09. Or you can download OpenCyc whose first release I lobbied for, and then created with John DeOliveira while at Cycorp. -Steve Stephen L. Reed Artificial Intelligence Researcher http://texai.org/blog http://texai.org 3008 Oak Crest Ave. Austin, Texas, USA 78704 512.791.7860 ----- Original Message ---- From: Matt Mahoney <[EMAIL PROTECTED]> To: [email protected] Sent: Sunday, September 28, 2008 8:38:36 PM Subject: Re: [agi] universal logical form for natural language --- On Sun, 9/28/08, Ben Goertzel <[EMAIL PROTECTED]> wrote: >FYI, Cyc has a natural language front end and a lot of folks have been >>working on it for the last 5+ years... It still needs work. I found this undated (2004 or later) white paper which is apparently not linked from cyc.com. http://www.cyc.com/doc/white_papers/KRAQ2005.pdf And also this overview. http://www.cyc.com/cyc/cycrandd/areasofrandd_dir/nlu The overview claims to be able to convert natural language sentences into Cycl assertions, and to convert questions to Cycl queries. So I wonder why the knowledge base is still not being built this way. And I wonder why there is no public demo of the interface, and no papers giving verifiable experimental results. It seems to me the main limitation is that the language model has to be described formally in Cycl, as a lexicon and rules for parsing and disambiguation. There seems to be no mechanism for learning natural language by example. For example, if Cyc receives a sentence it cannot parse, or is ambiguous, or has a word not in its vocabulary or used in a different way, then there is no mechanism to update the model, which is something humans easily do. Given the complexity of English, I think this is a serious limitation with no easy solution. -- Matt Mahoney, [EMAIL PROTECTED] ------------------------------------------- agi Archives: https://www.listbox.com/member/archive/303/=now RSS Feed: https://www.listbox.com/member/archive/rss/303/ Modify Your Subscription: https://www.listbox.com/member/?& Powered by Listbox: http://www.listbox.com ------------------------------------------- agi Archives: https://www.listbox.com/member/archive/303/=now RSS Feed: https://www.listbox.com/member/archive/rss/303/ Modify Your Subscription: https://www.listbox.com/member/?member_id=8660244&id_secret=114414975-3c8e69 Powered by Listbox: http://www.listbox.com
