Re: FOSS tool to do general stats from text indata

2023-06-30 Thread Emanuel Berg
>> Well if you were prepared to type a search for >> computational linguistics software into google, you would >> find several free tools available for linux listed on pages >> such as >> >> https://martinweisser.org/corpora_site/comp_ling_resources.html > > Indeed, that page has 4 hits for Unix

Re: FOSS tool to do general stats from text indata

2023-06-30 Thread Emanuel Berg
debian-user wrote: > Well if you were prepared to type a search for computational > linguistics software into google, you would find several > free tools available for linux listed on pages such as > > https://martinweisser.org/corpora_site/comp_ling_resources.html Indeed, that page has 4 hits

Re: FOSS tool to do general stats from text indata

2023-06-30 Thread debian-user
Emanuel Berg wrote: > Nicholas Geovanis wrote: > > > Those books teach and discuss some of the software that's > > used. I doubt you will find them in debian's repositories. > > Of course you can do plenty of computational linguistics > > with perl or python which you already have. > > > > What

Re: FOSS tool to do general stats from text indata

2023-06-30 Thread Emanuel Berg
>> A basic search finds this web tool: >> >> https://www.usingenglish.com/resources/text-statistics/ > > I didn't get it to work in Emacs-w3m, be it lack of JavaScript > support or something else. Anyway the page and tool claims to > do this: > > Total Word Count > Total Word Count (Excluding

Re: FOSS tool to do general stats from text indata

2023-06-30 Thread Emanuel Berg
Nicholas Geovanis wrote: > Those books teach and discuss some of the software that's > used. I doubt you will find them in debian's repositories. > Of course you can do plenty of computational linguistics > with perl or python which you already have. > > What is a "regular expression" which is at

Re: FOSS tool to do general stats from text indata

2023-06-30 Thread Nicholas Geovanis
On Fri, Jun 30, 2023, 10:32 AM Emanuel Berg wrote: > Nicholas Geovanis wrote: > > > If you have python programming skills, you might > > consider NLTK > > Unbelievable if there are no such tools anywhere already, > but I don't have one either so maybe there aren't then? >

Re: FOSS tool to do general stats from text indata

2023-06-30 Thread Emanuel Berg
Joel Roth wrote: > A basic search finds this web tool: > > https://www.usingenglish.com/resources/text-statistics/ I didn't get it to work in Emacs-w3m, be it lack of JavaScript support or something else. Anyway the page and tool claims to do this: Total Word Count Total Word Count

Re: FOSS tool to do general stats from text indata

2023-06-30 Thread Emanuel Berg
Nicholas Geovanis wrote: > If you have python programming skills, you might > consider NLTK Unbelievable if there are no such tools anywhere already, but I don't have one either so maybe there aren't then? >>> >>> There's a big subject called computational linguistics. >>>

Re: FOSS tool to do general stats from text indata

2023-06-30 Thread Nicholas Geovanis
On Fri, Jun 30, 2023, 8:32 AM Emanuel Berg wrote: > Nicholas Geovanis wrote: > > >>> If you have python programming skills, you might consider > >>> NLTK > >> > >> Unbelievable if there are no such tools anywhere already, > >> but I don't have one either so maybe there aren't then? > >> > > > >

Re: FOSS tool to do general stats from text indata

2023-06-30 Thread Emanuel Berg
Nicholas Geovanis wrote: >>> If you have python programming skills, you might consider >>> NLTK >> >> Unbelievable if there are no such tools anywhere already, >> but I don't have one either so maybe there aren't then? >> > > There's a big subject called computational linguistics. > They have

Re: FOSS tool to do general stats from text indata

2023-06-28 Thread Nicholas Geovanis
On Sat, Jun 24, 2023, 3:04 PM Emanuel Berg wrote: > Cousin Stanley wrote: > > > If you have python programming skills, you might consider > > NLTK > > Unbelievable if there are no such tools anywhere already, but > I don't have one either so maybe there aren't then? > There's a big subject

Re: FOSS tool to do general stats from text indata

2023-06-28 Thread Emanuel Berg
dvalin wrote: > As "stats" is a grab bag larger inside than the Tardis, > I suspect that only on that other ship with the infinite > improbability drive is a stats babelfish interpreter to be > found. For the last 30+ years, I've just thrown together > a few lines of Awk to generate the initially

Re: FOSS tool to do general stats from text indata

2023-06-25 Thread tomas
On Sun, Jun 25, 2023 at 08:28:05AM +0200, Emanuel Berg wrote: > tomas wrote: > > I mean a general tool, but with options to tweak the > report included, of course. > >>> > >>> If you can bear some tweaking, R is it. > >> > >> Sure! Let's run R on this e-mail. Does it work and if so,

Re: FOSS tool to do general stats from text indata

2023-06-25 Thread Emanuel Berg
tomas wrote: I mean a general tool, but with options to tweak the report included, of course. >>> >>> If you can bear some tweaking, R is it. >> >> Sure! Let's run R on this e-mail. Does it work and if so, what >> does it say? > > T a generic question -- a generic answer R is a

Re: FOSS tool to do general stats from text indata

2023-06-24 Thread tomas
On Sat, Jun 24, 2023 at 10:00:05PM +0200, Emanuel Berg wrote: > tomas wrote: > > >> Is there a CLI and FOSS tool that creates stats from text > >> indata - e.g., > >> > >> $ txt2stats path/to/indata/*.txt > >> > >> I mean a general tool, but with options to tweak the report > >> included, of

Re: FOSS tool to do general stats from text indata

2023-06-24 Thread John Hasler
Emanuel Berg writes: > Sure! Let's run R on this e-mail. Does it work and if so, what > does it say? Run 'apt-cache show r-base'. You will want to look at all the 'r-cran' packages for one that does what you need. -- John Hasler j...@sugarbit.com Elmwood, WI USA

Re: FOSS tool to do general stats from text indata

2023-06-24 Thread Emanuel Berg
tomas wrote: >> Is there a CLI and FOSS tool that creates stats from text >> indata - e.g., >> >> $ txt2stats path/to/indata/*.txt >> >> I mean a general tool, but with options to tweak the report >> included, of course. > > If you can bear some tweaking, R is it. Sure! Let's run R on this

Re: FOSS tool to do general stats from text indata

2023-06-24 Thread Emanuel Berg
Cousin Stanley wrote: > If you have python programming skills, you might consider > NLTK Unbelievable if there are no such tools anywhere already, but I don't have one either so maybe there aren't then? -- underground experts united https://dataswamp.org/~incal

Re: FOSS tool to do general stats from text indata

2023-06-24 Thread Emanuel Berg
Joel Roth wrote: > A basic search finds this web tool: > > https://www.usingenglish.com/resources/text-statistics/ Cool, I'll get back to you when I tried it God willing ... > Otherwise, I think you'll have to write your own -- or hire > someone (like me :^) to write one for you. Surely there

Re: FOSS tool to do general stats from text indata

2023-06-24 Thread Emanuel Berg
paulf wrote: >>> I don't know about all of your wishlist, but gnuplot is >>> the proper tool for taking data from, say, a CSV file, and >>> putting it into graphs of various types. >> >> Well, gnuplot is great obviously but is more a tool to >> visualize data, organized data, here we need a tool

Re: FOSS tool to do general stats from text indata

2023-06-24 Thread Cousin Stanley
On 2023-06-23 13:30, Emanuel Berg wrote: > Is there a CLI and FOSS tool that creates stats from text > indata - e.g., > >$ txt2stats path/to/indata/*.txt > > I mean a general tool, but with options to tweak the report > included, of course. > > To produce neat stats, maybe even figures, and

Re: FOSS tool to do general stats from text indata

2023-06-23 Thread tomas
On Fri, Jun 23, 2023 at 10:20:50PM +0200, Emanuel Berg wrote: > Is there a CLI and FOSS tool that creates stats from text > indata - e.g., > > $ txt2stats path/to/indata/*.txt > > I mean a general tool, but with options to tweak the report > included, of course. If you can bear some tweaking,

Re: FOSS tool to do general stats from text indata

2023-06-23 Thread Joel Roth
On Fri, Jun 23, 2023 at 10:20:50PM +0200, Emanuel Berg wrote: > Is there a CLI and FOSS tool that creates stats from text > indata - e.g., > > $ txt2stats path/to/indata/*.txt > > I mean a general tool, but with options to tweak the report > included, of course. > > To produce neat stats,

Re: FOSS tool to do general stats from text indata

2023-06-23 Thread paulf
On Fri, 23 Jun 2023 23:05:10 +0200 Emanuel Berg wrote: > paulf wrote: > > > I don't know about all of your wishlist, but gnuplot is the > > proper tool for taking data from, say, a CSV file, and > > putting it into graphs of various types. > > Well, gnuplot is great obviously but is more a

Re: FOSS tool to do general stats from text indata

2023-06-23 Thread Emanuel Berg
paulf wrote: > I don't know about all of your wishlist, but gnuplot is the > proper tool for taking data from, say, a CSV file, and > putting it into graphs of various types. Well, gnuplot is great obviously but is more a tool to visualize data, organized data, here we need a tool to analyze and

Re: FOSS tool to do general stats from text indata

2023-06-23 Thread paulf
On Fri, 23 Jun 2023 22:20:50 +0200 Emanuel Berg wrote: > Is there a CLI and FOSS tool that creates stats from text > indata - e.g., > > $ txt2stats path/to/indata/*.txt > > I mean a general tool, but with options to tweak the report > included, of course. > > To produce neat stats, maybe