URL:
<http://gna.org/task/?func=detailitem&item_id=3047>
Summary: Submission of Python bindings for the SFST library
Project: Gna! Administration
Submitted by: tarnold
Submitted on: Samstag 18.03.2006 um 19:29
Status: None
Approval Status: None
Should Start On: Samstag 18.03.2006 um 00:00
Should be Finished on: Dienstag 28.03.2006 um 00:00
Category: Project Approval
Priority: 5 - Normal
Privacy: Public
Assigned to: None
Open/Closed: Open
_______________________________________________________
Details:
A new project has been registered at Gna!
The project account will remain inactive until a site admin approve or
discard the registration.
######### REGISTRATION ADMINISTRATION #########
While this item will be useful to track the registration process, approving
or discarding the registration must be done using the specific "Group
Administration" page, accessible only to site administrators, effectively
logged as site administrators (superuser):
<https://gna.org/siteadmin/groupedit.php?group_id=1559>
######### REGISTRATION DETAILS #########
Full Name:
----------
Python bindings for the SFST library
System Group Name:
-----------------
pysfst
Type:
-----
Programs
License:
--------
GNU General Public License V2 or later
Description:
------------
The pysfst module contains Python bindings for the API of SFST, the
Stuttgart Finite State Transducer Tools. The source URL is
http://mypage.bluewin.ch/tarnold/pysfst/pysfst-0.0.1.tar.bz2
The interface was generated using SWIG and is included in the distribution.
The hardest part for now was to overload the stream oriented API of the C++
library with a more pythonic string list oriented one using pipes.
SFST itself (see dependencies below) comes with a german morphology called
SMOR. An excerpt from the doctests for pysfst is best suited to demonstrate
what pysfst/SFST is for:
This is the lexical example entry (the german name for the bird called
"bullfinch" in english) we'll use for now:
<Base_Stems>Dompfaff<NN><base><nativ><NMasc_es_en>
Often the result of analyze_string is ambiguous. In that case, analyze_string
returns any matching disambiguation in a list.
>>> smor_transducer.analyze_string('Dompfaff')
['Dompfaff<+NN><Masc><Dat><Sg>', 'Dompfaff<+NN><Masc><Nom><Sg>',
'Dompfaff<+NN><Masc><Akk><Sg>']
Word form generation is sometimes ambiguous, too. Here we generate our sample
bird in the genitive singular case which optionially allows the elision of an
'e'.
>>> smor_transducer.generate_string('Dompfaff<+NN><Masc><Gen><Sg>')
['Dompfaffs', 'Dompfaffes']
Other Software Required:
------------------------
Python: http://www.python.org
SWIG: http://www.swig.org
SFST: http://www.ims.uni-stuttgart.de/projekte/gramotron/SOFTWARE/SFST.html
"The SFST tools are available under the GNU Public License and can be
downloaded as a gzip-compressed tar archive."
Other Comments:
---------------
I did not yet inform Helmut Schmid, the author of SFST, about pysfst. I
planned to do it when/if the project page is up.
_______________________________________________________
Reply to this item at:
<http://gna.org/task/?func=detailitem&item_id=3047>
_______________________________________________
Nachricht geschickt von/durch Gna!
http://gna.org/
_______________________________________________
Register mailing list
[email protected]
http://mail.gna.org:8080/listinfo/register