Hi Pravin Ji,
This is what I have gathered from Language Technology Kendra (LTK),
PatanDhoka with regards to your query.

================================================================================
The database structure of online database has a pretty simple structure,
which was later redesigned mainly for quick information retrieval, by
generating formatted rich-text information from the relational database. The
rich-text information was generated using a script, which we used to run to
generate and dump the formatted content into the database. The formatted
content are formatted using HTML tags, hence they are only suited for the
design of the website. In order to reuse the database, the person, who is
willing to convert the database into SQLite database, needs the main
relational database of dictionary, and has to customize the script in order
to generate their own formatted text. Before running the tool, the database
tables need to be vacuumed. It might take, at least, one hour to generate
formatted content from the relational database. If the formatting of
information is not the issue, the online database, available for download,
can be converted into SQLite database without any problem.

The script used to generate formatted result is "indexer.php" which is made
available in the Sabdakos source, that is distributed with the Nepalinux DVD
distribution. The formatted result contains information from several
relational tables, and are dumped into two tables
"tbl_component_indv_entry_index" and "tbl_component_main_entry_index". The
table "tbl_component_indv_entry_index" depends on the information stored in
the table "tbl_component_main_entry_index".The
"tbl_component_main_entry_index" contains all the unique entries of root,
derived, inflected forms, along with phrases, idioms, compounds, and
proverbs. The type of record is distinguished by the fields
'component_indv_entry_index_comp_type'  and
'component_indv_entry_index_entry_type'. And, the table
'tbl_component_indv_entry_index' contains dictionary information of each
unique entry of the same word. Uniqueness of entry is caused by difference
in part-of-speech, guide word, or pragmatics. All the care is taken by the
script called "indexer.php" which pulls the records from the relational
database.
==============================================================================================

I also gathered that currently neither the source or the main relational
database is available for download in the Sabdakos website. We will look
into keeping all of these online in future. But for now, I would suggest you
to contact Mr. Dibyendra Hyoju and/or Mr. Vijay Shrestha (the technical
team) at LTK involved in the Sabdakos Project.

Regards,
Bal Krishna Bal
Language Technology Kendra
Lalitpur, PatanDhoka
Nepal

P.S Language Technology Kendra (LTK) is a recent collaboration between Madan
Puraskar Pustakalaya, Kathmandu University, Tribhuvan University. It's
office is located at the same building where Madan Puraskar Pustakalya
remains as of now.

On Fri, Jun 25, 2010 at 7:14 AM, C2 <[email protected]> wrote:

> I analyzed the tables in that database. It seems that this is not the
> updated database. If someone can create a list of nepali words then I
> can create a crawler which will query for the meaning of the words at
> nepalisabdakos.com and create a word-meaning database. Unless, there
> is already an updated database that we can get from them.
>
>
> On Jun 24, 6:03 am, pravin joshi <[email protected]> wrote:
> > Can someone explain to me the structure of the database forhttp://
> www.nepalisabdakos.com/index.php?option=db
> > I want to convert that to sqlite.
> > Pravin
> >
> > On Jun 22, 8:15 am, C2 <[email protected]> wrote:
> >
> > > If it is a Nepali dictionary that you want, then you could probably
> > > use the database available athttp://
> www.nepalisabdakos.com/index.php?option=db.
> >
> > > On Jun 20, 5:44 pm, pravin joshi <[email protected]> wrote:
> >
> > > > One comment : the topic of FOSS software initiation fizzles out very
> > > > quickly in this list, if you notice. I have seen this topic come up
> > > > every once in a while, but dies out so fast that one hardly gets time
> > > > to think. I think this topic has to be kept alive as long as
> possible,
> > > > so that after the initial crunch ideas begin to flow.
> > > > Laxmi Ji has been repeatedly looking at tools that will help people
> > > > for Nepali language related stuff. I like his ideas in the sense that
> > > > they are the tools that are required to allow adaptation of FOSS
> tools
> > > > in organizations fast, which should be our large target. Laxmi Ji,
> > > > here is a tool I propose, a tool similar to Artha for Nepali - Artha
> > > > is a free cross-platform English thesaurus that works completely off-
> > > > line and is based on WordNet. Its available athttp://
> artha.sourceforge.net/wiki/index.php/Home
> > > > If we can get hold of a softcopy of any of the Nepali dictionaries or
> > > > thesaurus that are available in the market, I am willing to putup the
> > > > time to program the 1st version of the software.
> > > > Pravin
> >
> > > > On Jun 20, 11:25 am, Laxmi Khatiwada <[email protected]> wrote:
> >
> > > > > Hi all
> >
> > > > > Nice to see the initiation about foss application. We can create
> English
> > > > > Nepali dictionary to embed apertium.
> > > > > It is for open source English Nepali translation. Apertium is an
> open source
> > > > > translation application + language hub.
> >
> > > > >
> http://wiki.apertium.org/wiki/Bilingual_dictionaryhttp://wiki.apertiu...
> >
> > > > > some tips for notation and reordering
> >
> > > > >http://wiki.apertium.org/wiki/Nepali_and_English
> >
> > > > > Can we collect some English Nepali word list in XML format?
> >
> > > > > On Sat, Jun 19, 2010 at 7:25 AM, C2 <[email protected]> wrote:
> > > > > > It's good to see people are interested in initiating FOSS
> software.
> > > > > > It's good to start with some of the already started projects and
> move
> > > > > > on to some new ones. Also, I think it's great to work with some
> > > > > > projects related to Nepali Language, but may be we could also
> start
> > > > > > some projects that has a wider audience.
> > > > > > Right now I'm thinking about forking some of the open-source
> projects
> > > > > > that are useful but has less or no developers supporting it. It
> would
> > > > > > be nice to start with something that has a code-base and some
> user-
> > > > > > base. Also, we could start e-Barnamala too. I think we should go
> with
> > > > > > the kind of application that will attract more no. of users.
> >
> > > > > > Could anyone confirm whether there is a Nepali to Unicode plugin
> for
> > > > > > Open Office ? I think that would be handy too.
> >
> > > > > > --
> > > > > > FOSS Nepal mailing list: [email protected]
> > > > > >http://groups.google.com/group/foss-nepal
> > > > > > To unsubscribe, e-mail: 
> > > > > > [email protected]<foss-nepal%[email protected]>
> <foss-nepal%[email protected]<foss-nepal%[email protected]>
> >
> >
> > > > > > Mailing List Guidelines:
> > > > > >http://wiki.fossnepal.org/index.php?title=Mailing_List_Guidelines
> > > > > > Community website:http://www.fossnepal.org/
>
> --
> FOSS Nepal mailing list: [email protected]
> http://groups.google.com/group/foss-nepal
> To unsubscribe, e-mail: 
> [email protected]<foss-nepal%[email protected]>
>
> Mailing List Guidelines:
> http://wiki.fossnepal.org/index.php?title=Mailing_List_Guidelines
> Community website: http://www.fossnepal.org/
>

-- 
FOSS Nepal mailing list: [email protected]
http://groups.google.com/group/foss-nepal
To unsubscribe, e-mail: [email protected]

Mailing List Guidelines: 
http://wiki.fossnepal.org/index.php?title=Mailing_List_Guidelines
Community website: http://www.fossnepal.org/

Reply via email to