Thanks C2, We need a corpus with 100M words. It has fewer words. Nevertheless its something than having nothing at all. To start we need a corpus with 10M words.
This corpus must have been provided by the Nepali community earlier and I am pretty much sure that someone has an updated corpus. I wished to get that corpus. Its a matter of time when the corpus will be prepared and then finally we might be able to use a MT for Nepali. On Thu, Aug 11, 2011 at 11:03 AM, C2 <[email protected]> wrote: > I think there is a Nepali corpus available at > > http://www.crulp.org/software/ling_resources/urdunepalienglishparallelcorpus.htm > . > > http://www.nltk.org/ is the natural language toolkit in python that > allows to POS tag english words and other features. > > Hope that helps. > > On Aug 10, 11:14 pm, Rajesh Pandey <[email protected]> wrote: > > Hello C2, > > Thanks for the reply and about the POS tagger, chunker and classifiers. > > Are there any links for them and how to get them? People have done lots > of > > work for Nepali.http://ltk.org.npdoes not have a link to download the > > source code / corpus for Nepali. The download link redirects to > > nlp.ku.edu.np which is a webpage where we can use it, that's fair, but I > > want to modify it test it, commit it again or may be have a local copy > for > > myself to use it offline. > > > > Is there a MT download link / a copy of MT that has been used in Nepali, > > link to corpus > > I am interested in getting a Machine translator and a corpus for Nepali. > > > > There are plans. I know people have already worked in various POS tagger, > > Chunkers, classifiers and have created the corpus. I don't want to do > > everything from the very beginning (reinventing the wheel). > > I would rather use the wheel to create a bicycle to move faster to the > > destination :) > > > > ----------- > > Its for the community, its for my satisfaction because I use Nepali as a > > second language when I use a computer. I can use a lot of tools in > English > > and the language "Nepali" has very less features, which is sad. > Technology > > has touched the sky and I feel like the "Nepali language" is at the stone > > age. I can't use a translator tool such as Google translator or babelfish > > which is freely available and that makes me crazy. > > > > If any one who's reading this email has any source code, download link to > > Nepali MT, Nepali Corpus, helpful resources, *please share*. I know the > > "Google Search" to search for the resources, the "Google Search" is not > > helping much, but I know the community as well whom I trust more. > > Thanks. > > > > On Wed, Aug 10, 2011 at 10:35 PM, C2 <[email protected]> wrote: > > > How about using Python ? It has lots of helpful language processing > > > toolkits, nltk comes to mind. A built in POS tagger, NP chunker, > > > Classifiers which could all be useful in building a translator. May I > > > ask if you have any plans to go beyond the rule based approach ? > > > > http://code.google.com/p/nepaliwikipediatranslator > > -- > > Rajesh Pandey > > -- > FOSS Nepal mailing list: [email protected] > http://groups.google.com/group/foss-nepal > To unsubscribe, e-mail: [email protected] > > Mailing List Guidelines: > http://wiki.fossnepal.org/index.php?title=Mailing_List_Guidelines > Community website: http://www.fossnepal.org/ > -- Rajesh Pandey -- FOSS Nepal mailing list: [email protected] http://groups.google.com/group/foss-nepal To unsubscribe, e-mail: [email protected] Mailing List Guidelines: http://wiki.fossnepal.org/index.php?title=Mailing_List_Guidelines Community website: http://www.fossnepal.org/
