I am starting to think about my own text-oriented workshop for social scientists that would use Gigantum to help support beginners thinking more about big-picture aspects of data projects. From there, we'd transition to explain how you could do similar things "by hand" (e.g. using git + LFS directly).
Ziyaad did you ever create an NLP working group? In any case, I'm happy to contribute to core carpentry-style materials (and I can easily adapt them to my own purposes later). And of course, if anyone wants to do stuff with Gigantum, I'm excited to help with that! But my sense is that this community is pretty committed to the command line ;) In terms of concepts, I'm interested in things that will help learners grok general ideas. So, understanding the transformation of texts to a matrix of frequency counts, perhaps some basics of what a matrix is, and what you can do with linear systems, and perhaps also the idea of an abstract "space" like word2vec. And as far as tools, I'm happy to do R or Python (and use RStudio or Jupyter). I see one vote for Spacy - so if that's a way to get someone on-board that sounds good to me! I've not done NLP for a few years, and spacy looks like magical python easy (cf. XKCD <https://xkcd.com/353/>). Best, Dav On Fri, Jul 19, 2019 at 9:11 AM <[email protected]> wrote: > Hi all, > > just to let you know ... I'm planning to prepare a carpentries style > course on text mining... which is a type of NLP (i.e. Natural Language > Processing). :-) > > Bea Alex (Edinburgh) > *The Carpentries <https://carpentries.topicbox.com/latest>* / discuss / > see discussions <https://carpentries.topicbox.com/groups/discuss> + > participants <https://carpentries.topicbox.com/groups/discuss/members> + > delivery > options <https://carpentries.topicbox.com/groups/discuss/subscription> > Permalink > <https://carpentries.topicbox.com/groups/discuss/Tee8f460df4206334-M6ae10f32741988f2203ef37e> > ------------------------------------------ The Carpentries: discuss Permalink: https://carpentries.topicbox.com/groups/discuss/Tee8f460df4206334-Mb454b7276b0ab684e86d462c Delivery options: https://carpentries.topicbox.com/groups/discuss/subscription
