I first learned about SQLite in the Bioconductor branch of R. I figured if they could handle massive genetic databases in SQLite, SQLite ought to be able to handle a million (or even 12 million) voters in a voter file.
Here is a brief article from 2006, "How to Use SQLite with R" by Seth Falcon. http://master.bioconductor.org/help/course-materials/2006/rforbioinformatics/labs/thurs/SQLite-R-howto.pdf Jim On Thu, Feb 19, 2015 at 2:08 PM, Jim Callahan < jim.callahan.orlando at gmail.com> wrote: > Strongly agree with using the R package Sqldf. > I used both RSQLite and Sqldf, both worked extremely well (and I am both a > lazy and picky end user). Sqldf had the advantage that it took you all the > way to your destination the workhorse R object the data frame (R can define > new objects, but the data frame as an in memory table is the default). > The SQLITE3 command line interface and the R command line had a nice > synergy; SQL was great for getting a subset of rows and columns or building > a complex view from multiple tables. Both RSqlite and Sqldf could > understand the query/view as a table and all looping in both SQL and R took > place behind the scenes in compiled code. > > Smart phone users say "there is an app for that". R users would say "there > is a package for that" and CRAN is the equivalent of the Apple app store or > Google Play. > > R has packages for graphics, classical statistics, Bayesian statistics and > machine learning. R also has packages for spacial statistics (including > reading ESRI shapefiles), for graph theory and for building decision trees. > There is another whole app store for biological applications "bioconductor". > > The CRAN website has "views" (pages or blogs) showing how packages solve > common problems in a variety of academic disciplines or application areas. > > Jim Callahan > On Feb 19, 2015 11:38 AM, "Gabor Grothendieck" <ggrothendieck at gmail.com> > wrote: > >> On Wed, Feb 18, 2015 at 9:53 AM, Richard Hipp <drh at sqlite.org> wrote: >> > On 2/18/15, Jim Callahan <jim.callahan.orlando at gmail.com> wrote: >> >> I would mention the open source statistical language R in the "data >> >> analysis" section. >> > >> > I've heard of R but never tried to use it myself. Is an SQLite >> > interface built into R, sure enough? Or is that something that has to >> > be added in separately? >> > >> >> RSQLite is an add-on package to R; however, for data analysis (as >> opposed to specific database manipulation) I would think most R users >> would use my sqldf R add-on package (which uses RSQLite by default and >> also can use driver packages of certain other databases) rather than >> RSQLite directly if they were going to use SQL for that. >> >> In R a data.frame is like an SQL table but in memory and sqldf lets >> you apply SQL statements to them as if they were all one big SQLite >> database. A common misconception is it must be slow but in fact its >> sufficiently fast that some people use it to get a speed advantage >> over plain R. Others use it to learn SQL or to ease the transition to >> R and others use it allow them to manipulate R data frames without >> knowing much about R provided they know SQL. >> >> If you have not tried R this takes you through installing R and >> running sqldf in about 5 minutes: >> https://sqldf.googlecode.com/#For_Those_New_to_R >> >> The rest of that page gives many other examples. >> _______________________________________________ >> sqlite-users mailing list >> sqlite-users at mailinglists.sqlite.org >> http://mailinglists.sqlite.org/cgi-bin/mailman/listinfo/sqlite-users >> >