Thanks C2,
We need a corpus with 100M words. It has fewer words. Nevertheless its
something than having nothing at all.
To start we need a corpus with 10M words.

This corpus must have been provided by the Nepali community earlier and I am
pretty much sure that someone has an updated corpus. I wished to get that
corpus. Its a matter of time when the corpus will be prepared and then
finally we might be able to use a MT for Nepali.



On Thu, Aug 11, 2011 at 11:03 AM, C2 <[email protected]> wrote:

> I think there is a Nepali corpus available at
>
> http://www.crulp.org/software/ling_resources/urdunepalienglishparallelcorpus.htm
> .
>
> http://www.nltk.org/ is the natural language toolkit in python that
> allows to POS tag english words and other features.
>
> Hope that helps.
>
> On Aug 10, 11:14 pm, Rajesh Pandey <[email protected]> wrote:
> > Hello C2,
> > Thanks for the reply and about the POS tagger, chunker and classifiers.
> > Are there any links for them and how to get them? People have done lots
> of
> > work for Nepali.http://ltk.org.npdoes not have a link to download the
> > source code / corpus for Nepali. The download link redirects to
> > nlp.ku.edu.np which is a webpage where we can use it, that's fair, but I
> > want to modify it test it, commit it again or may be have a local copy
> for
> > myself to use it offline.
> >
> > Is there a MT download link / a copy of MT that has been used in Nepali,
> > link to corpus
> > I am interested in getting a Machine translator and a corpus for Nepali.
> >
> > There are plans. I know people have already worked in various POS tagger,
> > Chunkers, classifiers and have created the corpus. I don't want to do
> > everything from the very beginning (reinventing the wheel).
> > I would rather use the wheel to create a bicycle to move faster to the
> > destination :)
> >
> > -----------
> > Its for the community, its for my satisfaction because I use Nepali as a
> > second language when I use a computer. I can use a lot of tools in
> English
> > and the language "Nepali" has very less features, which is sad.
> Technology
> > has touched the sky and I feel like the "Nepali language" is at the stone
> > age. I can't use a translator tool such as Google translator or babelfish
> > which is freely available and that makes me crazy.
> >
> > If any one who's reading this email has any source code, download link to
> > Nepali MT, Nepali Corpus, helpful resources, *please share*. I know the
> > "Google Search" to search for the resources, the "Google Search" is not
> > helping much,  but I know the community as well whom I trust more.
> > Thanks.
> >
> > On Wed, Aug 10, 2011 at 10:35 PM, C2 <[email protected]> wrote:
> > > How about using Python ? It has lots of helpful language processing
> > > toolkits, nltk comes to mind. A built in POS tagger, NP chunker,
> > > Classifiers which could all be useful in building a translator. May I
> > > ask if you have any plans to go beyond the rule based approach ?
> >
> > http://code.google.com/p/nepaliwikipediatranslator
> > --
> > Rajesh Pandey
>
> --
> FOSS Nepal mailing list: [email protected]
> http://groups.google.com/group/foss-nepal
> To unsubscribe, e-mail: [email protected]
>
> Mailing List Guidelines:
> http://wiki.fossnepal.org/index.php?title=Mailing_List_Guidelines
> Community website: http://www.fossnepal.org/
>



-- 
Rajesh Pandey

-- 
FOSS Nepal mailing list: [email protected]
http://groups.google.com/group/foss-nepal
To unsubscribe, e-mail: [email protected]

Mailing List Guidelines: 
http://wiki.fossnepal.org/index.php?title=Mailing_List_Guidelines
Community website: http://www.fossnepal.org/

Reply via email to