I don't do anything that mixes R and python so I can't comment on using the two interchangeably.
There are some api's that I've played around with that seem okay, but they probably buckle under a reasonable amount of preasure. On Tue, Dec 7, 2010 at 10:47 AM, Santosh Srinivas < [email protected]> wrote: > First ... thanks for data.table .... even I (with average R skills) am > able to do some cool stuff on large datasets and FAST!. > > I've heard that integrability with R is quite decent? > Python is next on my list but I hope my experience will be complementary > > > On Tue, Dec 7, 2010 at 10:10 PM, Rob Forler <[email protected]> wrote: > > It's not a hard language to pick up, but definitely one of the challenges > > with replacing R with python is many of the stat and data packages that > > exist in R. > > Python has strong scientific and data packages, but I not at the same > level. > > > > On the other hand python is much stronger from a language point of view > > (good oo etc), and has many more operational tools. > > > > -Rob > > > > On Tue, Dec 7, 2010 at 8:30 AM, Matthew Dowle <[email protected]> > > wrote: > >> > >> Interesting. I don't know python but it's been on the radar a few times. > >> > >> "Rob Forler" <[email protected]> wrote in message > >> news:[email protected]... > >> I'm coding in python now. The group I'm in now has a similar tool > (closed > >> source) to data.table but in python and is based on numpy. > >> > >> The api isn't as beautiful as data.table's, but has similar > functionality. > >> > >> -Rob > >> > >> On Tue, Dec 7, 2010 at 7:54 AM, Matthew Dowle <[email protected]> > >> wrote: > >>> > >>> Thanks Rob. That begs an obvious question then ... what are you coding > in > >>> now ? > >>> > >>> "Rob Forler" <[email protected]> wrote in message > >>> news:[email protected]... > >>> I can attest that I used data.table very extensively for several months > >>> on large datasets (financial). I was replacing a fair of poorly coded > >>> data/frame, sql, plyr, apply code, and was able to match the previous > >>> numbers and do a significant amount of new analysis because of the ease > of > >>> using data.tables. > >>> > >>> If I was still coding in R on a regular basis you can gauarauntee I'd > use > >>> data.table every day. > >>> > >>> Thanks, > >>> Rob > >>> > >>> On Tue, Dec 7, 2010 at 7:30 AM, Matthew Dowle <[email protected]> > >>> wrote: > >>>> > >>>> Just to clarify also about the date of first release - March 2010? Any > >>>> chance Mel you looked at the CRAN archive page and read off the last > >>>> row? > >>>> Oldest is first not last on that page : > >>>> > >>>> http://cran.r-project.org/src/contrib/Archive/data.table/ > >>>> > >>>> v1.0 was released April 2006 but that was removed from CRAN happily > >>>> because > >>>> base quickly (within weeks) included features that removed the need > for > >>>> data.table. It was re-released in Aug 2008 with new functionality so > >>>> that's > >>>> the relevant release date for your purpose. > >>>> > >>>> Feel free to post the puzzling results. You've done well to use it for > 2 > >>>> weeks without posting, so you can probably tilt towards using this > list > >>>> more > >>>> (on a new thread please). If we can get you over those hurdles first > >>>> then > >>>> reconsider if the 'robustness' question still stands. > >>>> > >>>> Other info which you may have not have found yet ... > >>>> > >>>> Crantastic has 5 detailed user reviews of data.table. It does state > that > >>>> v1.1 was released over 2 years ago, too, so leads me to guess you may > >>>> have > >>>> missed the link to crantastic on the data.table homepage. > >>>> > >>>> There are some oddities in the ranking formula but if you look at > >>>> http://crantastic.org/popcon and realise that the batch near the > bottom > >>>> starting with reshape, ggplot2 and plyr should be at the top (seems > like > >>>> a > >>>> bug, I'll let them know) then data.table appears to be around the 8th > >>>> most > >>>> popular CRAN package with average score 4.7/5 and 10 users, compared > to > >>>> ggplot2's 39 users. So crantastic itself is not popular since > everyone > >>>> knows that ggplot2 has many more than 39 users, and some very popular > >>>> and > >>>> stable packages don't have any votes at all. Even so perhaps this > small > >>>> amount of data may be useful in your assessment generally. > "data.table" > >>>> is > >>>> not the easiest to google for. > >>>> > >>>> The NEWS file (link on the homepage) says that v1.2 was released in > Aug > >>>> 2008, too, at the bottom, along with what changed in each release > since > >>>> then. > >>>> > >>>> Matthew > >>>> > >>>> "Tom Short" <[email protected]> wrote in message > >>>> news:[email protected]... > >>>> > On Mon, Dec 6, 2010 at 10:54 PM, mbacou <[email protected]> wrote: > >>>> >> My question is: is data.table ready for production? Would you rely > on > >>>> >> it > >>>> >> for > >>>> >> sensitive publications? > >>>> > > >>>> > If you have tight time deadlines, you may want to go with what you > >>>> > have experience with, especially if it involves complicated queries > or > >>>> > manipulations. If you've already tried the data.table features > you'll > >>>> > need for "production", then using data.table may help you get things > >>>> > done faster. > >>>> > > >>>> > Data.table has been robust for me on 6-GB datasets on a machine with > >>>> > 24 GB of ram. With data.table, as with most tools, user error is > more > >>>> > likely than a tool bug, so you need to test/check your data and your > >>>> > results. > >>>> > > >>>> > - Tom > >>>> > >>>> > >>>> > >>>> _______________________________________________ > >>>> datatable-help mailing list > >>>> [email protected] > >>>> > >>>> > https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/datatable-help > >>> > >>> ________________________________ > >>> > >>> _______________________________________________ > >>> datatable-help mailing list > >>> [email protected] > >>> > >>> > https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/datatable-help > >>> > >>> _______________________________________________ > >>> datatable-help mailing list > >>> [email protected] > >>> > >>> > https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/datatable-help > >>> > >> > >> ________________________________ > >> > >> _______________________________________________ > >> datatable-help mailing list > >> [email protected] > >> > >> > https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/datatable-help > >> > >> _______________________________________________ > >> datatable-help mailing list > >> [email protected] > >> > >> > https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/datatable-help > >> > > > > > > _______________________________________________ > > datatable-help mailing list > > [email protected] > > > https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/datatable-help > > > > > _______________________________________________ > datatable-help mailing list > [email protected] > https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/datatable-help >
_______________________________________________ datatable-help mailing list [email protected] https://lists.r-forge.r-project.org/cgi-bin/mailman/listinfo/datatable-help
