Hi Chen, Sorry this should have went to the Tika lists, my bad!
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527 Email: chris.a.mattm...@nasa.gov WWW: http://sunset.usc.edu/~mattmann/ ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Director, Information Retrieval and Data Science Group (IRDS) Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA WWW: http://irds.usc.edu/ ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ On 5/18/16, 11:33 PM, "Chen Li" <che...@gmail.com> wrote: >Just curious, how is this task related to AsterixDB? > > > >On Wed, May 18, 2016 at 8:57 AM, Mattmann, Chris A (3980) < >chris.a.mattm...@jpl.nasa.gov> wrote: > >> Hi Everyone, >> >> Anastasija and I met this morning. Here are her next steps: >> >> >> 0. Completed learning, installing and using GeoTopicParser in Apache Tika >> 1. Learning about Movie Review Dataset (labeled data, yay!) >> 2. Try and build OpeNNLP model for that >> >> She and I will meet again next week and report progress. >> >> Cheers, >> Chris >> >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ >> Chris Mattmann, Ph.D. >> Chief Architect >> Instrument Software and Science Data Systems Section (398) >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA >> Office: 168-519, Mailstop: 168-527 >> Email: chris.a.mattm...@nasa.gov >> WWW: http://sunset.usc.edu/~mattmann/ >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ >> Director, Information Retrieval and Data Science Group (IRDS) >> Adjunct Associate Professor, Computer Science Department >> University of Southern California, Los Angeles, CA 90089 USA >> WWW: http://irds.usc.edu/ >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ >> >> >> >> >> >> >> >> >> >> On 4/26/16, 12:23 PM, "Rodrigo Agerri" <rodrigo.age...@ehu.eus> wrote: >> >> >Hello, >> > >> >Everything looks very interesting. Other options are the Aspect Based >> >Sentiment Analysis tasks as described in >> > >> >http://alt.qcri.org/semeval2014/task4/ >> >http://alt.qcri.org/semeval2015/task12/ >> >http://alt.qcri.org/semeval2016/task5/ >> > >> >The task is well circumscribed plus data is publicly available, which >> >is good to try and make manageable objectives for a GSOC. >> > >> >Best, >> > >> >Rodrigo >> > >> > >> > >> >On Tue, Apr 26, 2016 at 6:10 PM, Anthony Beylerian >> ><anthony.beyler...@gmail.com> wrote: >> >> Please check this approach [1] it could be useful to combine >> >> a labeled seed set with unlabeled Fisher CallHome. >> >> Since it maybe a long read there's a shorter ppt as well [2] >> >> >> >> [1] link.springer.com/article/10.1023%2FA%3A1007692713085 >> >> [2] cseweb.ucsd.edu/~atsmith/presentation_final.ppt >> >> >> >> >> >> On Tue, Apr 26, 2016 at 11:36 PM, Joern Kottmann <kottm...@gmail.com> >> wrote: >> >> >> >>> The Large Movie Review Dataset might be interesting for this as well: >> >>> http://ai.stanford.edu/~amaas/data/sentiment/ >> >>> >> >>> Jörn >> >>> >> >>> On Tue, Apr 26, 2016 at 4:26 PM, Anthony Beylerian < >> >>> anthony.beyler...@gmail.com> wrote: >> >>> >> >>> > sentiment analysis discussion doc : >> >>> > >> >>> > >> >>> > >> >>> >> https://docs.google.com/document/d/1Gi59YqtisY4NLaVY3B7CNLMTgCRZm9JEk17kmBmWXqQ/edit?usp=sharing >> >>> > >> >>> > On Tue, Apr 26, 2016 at 10:56 PM, Mattmann, Chris A (3980) < >> >>> > chris.a.mattm...@jpl.nasa.gov> wrote: >> >>> > >> >>> > > Hi, >> >>> > > >> >>> > > Sure here is the link: >> >>> > > >> >>> > > https://hangouts.google.com/call/a2w5cgdtirf6jgfb4ww5l2l64ee >> >>> > > >> >>> > > Sorry for the delay. >> >>> > > >> >>> > > Cheers, >> >>> > > Chris >> >>> > > >> >>> > > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ >> >>> > > Chris Mattmann, Ph.D. >> >>> > > Chief Architect >> >>> > > Instrument Software and Science Data Systems Section (398) >> >>> > > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA >> >>> > > Office: 168-519, Mailstop: 168-527 >> >>> > > Email: chris.a.mattm...@nasa.gov >> >>> > > WWW: http://sunset.usc.edu/~mattmann/ >> >>> > > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ >> >>> > > Director, Information Retrieval and Data Science Group (IRDS) >> >>> > > Adjunct Associate Professor, Computer Science Department >> >>> > > University of Southern California, Los Angeles, CA 90089 USA >> >>> > > WWW: http://irds.usc.edu/ >> >>> > > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ >> >>> > > >> >>> > > >> >>> > > >> >>> > > >> >>> > > >> >>> > > >> >>> > > >> >>> > > >> >>> > > >> >>> > > On 4/26/16, 6:48 AM, "Anastasija Mensikova" < >> >>> > > mensikova.anastas...@gmail.com> wrote: >> >>> > > >> >>> > > >Hi everyone, >> >>> > > > >> >>> > > > >> >>> > > >Is the 9:40 ET hangout still happening? I just have to leave soon >> to >> >>> go >> >>> > > to class. >> >>> > > > >> >>> > > > >> >>> > > >Thank you, >> >>> > > >Anastasija >> >>> > > > >> >>> > > > >> >>> > > >On 25 April 2016 at 23:39, Anastasija Mensikova >> >>> > > ><mensikova.anastas...@gmail.com> wrote: >> >>> > > > >> >>> > > >Hi Chris, >> >>> > > > >> >>> > > > >> >>> > > >Yes, that's perfect. I'll be ready by 9:40am. >> >>> > > > >> >>> > > > >> >>> > > >Thank you, >> >>> > > >Anastasija >> >>> > > > >> >>> > > > >> >>> > > >On 25 April 2016 at 23:28, Mattmann, Chris A (3980) >> >>> > > ><chris.a.mattm...@jpl.nasa.gov> wrote: >> >>> > > > >> >>> > > >Hey Anastasija, >> >>> > > > >> >>> > > >To be honest 9am EST is a little aggressive, I will likely be able >> >>> > > >to do 6:40 am PT (am traveling back from DC as I type this) which >> >>> > > >is 9:40am ET. >> >>> > > > >> >>> > > >My GChat handle is chris.mattm...@gmail.com. I will create a >> hangout >> >>> > > >and send to the list please contact me at 6:40am PT. >> >>> > > > >> >>> > > >Cheers, >> >>> > > >Chris >> >>> > > > >> >>> > > >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ >> >>> > > >Chris Mattmann, Ph.D. >> >>> > > >Chief Architect >> >>> > > >Instrument Software and Science Data Systems Section (398) >> >>> > > >NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA >> >>> > > >Office: 168-519, Mailstop: 168-527 >> >>> > > >Email: chris.a.mattm...@nasa.gov >> >>> > > >WWW: >> >>> > > >http://sunset.usc.edu/~mattmann/ < >> http://sunset.usc.edu/~mattmann/> >> >>> > > >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ >> >>> > > >Director, Information Retrieval and Data Science Group (IRDS) >> >>> > > >Adjunct Associate Professor, Computer Science Department >> >>> > > >University of Southern California, Los Angeles, CA 90089 USA >> >>> > > >WWW: http://irds.usc.edu/ >> >>> > > >++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ >> >>> > > > >> >>> > > > >> >>> > > > >> >>> > > > >> >>> > > > >> >>> > > > >> >>> > > > >> >>> > > > >> >>> > > > >> >>> > > >On 4/25/16, 11:07 PM, "Anastasija Mensikova" < >> >>> > > mensikova.anastas...@gmail.com> wrote: >> >>> > > > >> >>> > > >>Hi everyone, >> >>> > > >> >> >>> > > >> >> >>> > > >>So is the hangout session tomorrow (Tuesday) at 6:30pm IST (9am >> EST) >> >>> > > confirmed or not? >> >>> > > >> >> >>> > > >> >> >>> > > >>Thank you, >> >>> > > >>Anastasija >> >>> > > >> >> >>> > > >> >> >>> > > >>On 25 April 2016 at 15:23, Madhawa Kasun Gunasekara >> >>> > > >><madhaw...@gmail.com> wrote: >> >>> > > >> >> >>> > > >>Hi all, >> >>> > > >> >> >>> > > >> >> >>> > > >>Shall we have the hangout session tomorrow (Tuesday) about 18:30 >> IST >> >>> ? >> >>> > > >> >> >>> > > >> >> >>> > > >>Thanks, >> >>> > > >> >> >>> > > >>Madhawa >> >>> > > >> >> >>> > > >> >> >>> > > >> >> >>> > > >> >> >>> > > >>Madhawa >> >>> > > >> >> >>> > > >> >> >>> > > >> >> >>> > > >> >> >>> > > >>On Sun, Apr 24, 2016 at 10:33 PM, Mondher Bouazizi >> >>> > > >><mondher.bouaz...@gmail.com> wrote: >> >>> > > >> >> >>> > > >>Hi, >> >>> > > >> >> >>> > > >>I am sorry for my late reply. >> >>> > > >> >> >>> > > >>Given the time difference between Japan and USA, I think I won't >> be >> >>> > > >>available on weekdays. I will be available only on >> Friday/Saturday >> >>> > > morning >> >>> > > >>(9-10am EST). >> >>> > > >> >> >>> > > >>I am not sure if Chris is OK with that, we had our previous >> meetings >> >>> on >> >>> > > >>Saturday mornings. >> >>> > > >> >> >>> > > >>Otherwise, please go ahead. I will join as soon as I can. >> >>> > > >> >> >>> > > >>Thanks. >> >>> > > >> >> >>> > > >>@Chris: my github ID is mondher-bouazizi >> >>> > > >> >> >>> > > >>Best regards, >> >>> > > >> >> >>> > > >>Mondher >> >>> > > >> >> >>> > > >>On Mon, Apr 25, 2016 at 1:44 AM, Anastasija Mensikova < >> >>> > > >>mensikova.anastas...@gmail.com> wrote: >> >>> > > >> >> >>> > > >>> Hi Anthony, >> >>> > > >>> >> >>> > > >>> I can make it by Madhawa's proposal too, after 6pm IST on >> Tuesday >> >>> > > (after >> >>> > > >>> 8:30am EST). Let me know when exactly! >> >>> > > >>> >> >>> > > >>> Thank you, >> >>> > > >>> Anastasija >> >>> > > >>> >> >>> > > >>> On 24 April 2016 at 03:02, Anthony Beylerian < >> >>> > > anthony.beyler...@gmail.com> >> >>> > > >>> wrote: >> >>> > > >>> >> >>> > > >>>> Hi Anastasija, >> >>> > > >>>> >> >>> > > >>>> I'm not available by those times (00-07 JST). I could make >> it by >> >>> > > >>>> Madhawa's proposal, but otherwise please go ahead, we may >> discuss >> >>> > some >> >>> > > >>>> other time. >> >>> > > >>>> >> >>> > > >>>> @Chris: github ID : beylerian >> >>> > > >>>> >> >>> > > >>>> Best, >> >>> > > >>>> >> >>> > > >>>> Anthony >> >>> > > >>>> >> >>> > > >>>> >> >>> > > >>>> Please find my github profile >> >>> > > > >> >>> > > > >> >>> > > >>https://github.com/madhawa-gunasekara < >> >>> > > https://github.com/madhawa-gunasekara> >> >>> > > >>>> >> >>> > > >>>> Madhawa >> >>> > > >>>> >> >>> > > >>>> On Sun, Apr 24, 2016 at 12:13 AM, Madhawa Kasun Gunasekara < >> >>> > > >>>> madhaw...@gmail.com> wrote: >> >>> > > >>>> >> >>> > > >>>> > Hi Chris, >> >>> > > >>>> > >> >>> > > >>>> > I'm available on Tuesday & Wednesday after 6.00 pm IST. >> >>> > > >>>> > >> >>> > > >>>> > Thanks, >> >>> > > >>>> > Madhawa >> >>> > > >>>> > >> >>> > > >>>> > Madhawa >> >>> > > >>>> > >> >>> > > >>>> > On Sat, Apr 23, 2016 at 11:38 PM, Anastasija Mensikova < >> >>> > > >>>> > mensikova.anastas...@gmail.com> wrote: >> >>> > > >>>> > >> >>> > > >>>> >> Hi Chris, >> >>> > > >>>> >> >> >>> > > >>>> >> Thank you very much for your email. I'm so excited to work >> with >> >>> > > you! >> >>> > > >>>> >> >> >>> > > >>>> >> My Github name is amensiko. >> >>> > > >>>> >> >> >>> > > >>>> >> And yes, next week sounds good! I'm available on: Tuesday >> at >> >>> > 4:20pm >> >>> > > >>>> EST, >> >>> > > >>>> >> Thursday 11am - 2:30pm and 4:20 - 6pm EST, Friday 11am - >> 3pm >> >>> EST. >> >>> > > >>>> >> >> >>> > > >>>> >> Thank you, >> >>> > > >>>> >> Anastasija >> >>> > > >>>> >> >> >>> > > >>>> >> On 23 April 2016 at 10:21, Mattmann, Chris A (3980) < >> >>> > > >>>> >> chris.a.mattm...@jpl.nasa.gov> wrote: >> >>> > > >>>> >> >> >>> > > >>>> >>> Hi Anastasija, >> >>> > > >>>> >>> >> >>> > > >>>> >>> Hope you are well. It’s now time to get started on the >> >>> project. >> >>> > > >>>> >>> Monder, Anthony, Madhawa and I have been discussing ideas >> >>> about >> >>> > > >>>> >>> how to proceed with the project and even developing a task >> >>> list. >> >>> > > >>>> >>> Let’s get your tasks input into that list, and also >> >>> coordinate. >> >>> > > >>>> >>> >> >>> > > >>>> >>> I also have an action to share some Spanish/English data >> to >> >>> try >> >>> > > >>>> >>> and do cross lingual sentiment analysis. >> >>> > > >>>> >>> >> >>> > > >>>> >>> Are you available to chat this week? >> >>> > > >>>> >>> >> >>> > > >>>> >>> Cheers, >> >>> > > >>>> >>> Chris >> >>> > > >>>> >>> >> >>> > > >>>> >>> >> >>> > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ >> >>> > > >>>> >>> Chris Mattmann, Ph.D. >> >>> > > >>>> >>> Chief Architect >> >>> > > >>>> >>> Instrument Software and Science Data Systems Section (398) >> >>> > > >>>> >>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA >> >>> > > >>>> >>> Office: 168-519, Mailstop: 168-527 >> >>> > > >>>> >>> Email: chris.a.mattm...@nasa.gov >> >>> > > >>>> >>> WWW: >> >>> > > > >> >>> > > > >> >>> > > >>http://sunset.usc.edu/~mattmann/ < >> http://sunset.usc.edu/~mattmann/> >> >>> > > >>>> >>> >> >>> > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ >> >>> > > >>>> >>> Director, Information Retrieval and Data Science Group >> (IRDS) >> >>> > > >>>> >>> Adjunct Associate Professor, Computer Science Department >> >>> > > >>>> >>> University of Southern California, Los Angeles, CA 90089 >> USA >> >>> > > >>>> >>> WWW: http://irds.usc.edu/ >> >>> > > >>>> >>> >> >>> > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ >> >>> > > >>>> >>> >> >>> > > >>>> >>> >> >>> > > >>>> >>> >> >>> > > >>>> >>> >> >>> > > >>>> >>> >> >>> > > >>>> >>> >> >>> > > >>>> >>> >> >>> > > >>>> >>> >> >>> > > >>>> >>> >> >>> > > >>>> >>> On 4/23/16, 4:49 AM, "Anthony Beylerian" < >> >>> > > anthony.beyler...@gmail.com >> >>> > > >>>> > >> >>> > > >>>> >>> wrote: >> >>> > > >>>> >>> >> >>> > > >>>> >>> >Hello, >> >>> > > >>>> >>> > >> >>> > > >>>> >>> >Congratulations for being accepted for this year's GSoC. >> >>> > > >>>> >>> >Although Mondher and myself will not participate this >> year as >> >>> > > >>>> students, >> >>> > > >>>> >>> we >> >>> > > >>>> >>> >will do our best to help. >> >>> > > >>>> >>> >We are currently busy with academic research, but will >> join >> >>> the >> >>> > > >>>> efforts >> >>> > > >>>> >>> >when possible. >> >>> > > >>>> >>> >Otherwise, for any discussion concerning the proposed >> >>> > approaches, >> >>> > > >>>> please >> >>> > > >>>> >>> >let us know. >> >>> > > >>>> >>> > >> >>> > > >>>> >>> >Best, >> >>> > > >>>> >>> > >> >>> > > >>>> >>> >On Sat, Apr 23, 2016 at 6:02 PM, Madhawa Kasun >> Gunasekara < >> >>> > > >>>> >>> >madhaw...@gmail.com> wrote: >> >>> > > >>>> >>> > >> >>> > > >>>> >>> >> Sure we will start working on this. >> >>> > > >>>> >>> >> >> >>> > > >>>> >>> >> Thanks, >> >>> > > >>>> >>> >> Madhawa >> >>> > > >>>> >>> >> >> >>> > > >>>> >>> >> Madhawa >> >>> > > >>>> >>> >> >> >>> > > >>>> >>> >> On Sat, Apr 23, 2016 at 1:38 AM, Chris Mattmann < >> >>> > > >>>> mattm...@apache.org> >> >>> > > >>>> >>> >> wrote: >> >>> > > >>>> >>> >> >> >>> > > >>>> >>> >>> Congrats! >> >>> > > >>>> >>> >>> >> >>> > > >>>> >>> >>> time to get started team. >> >>> > > >>>> >>> >>> >> >>> > > >>>> >>> >> >>> > > >>>> >> >> >>> > > >>>> >> >> >>> > > >>>> > >> >>> > > >>>> >> >>> > > >>> >> >>> > > >>> >> >>> > > >> >> >>> > > >> >> >>> > > >> >> >>> > > >> >> >>> > > >> >> >>> > > >> >> >>> > > >> >> >>> > > >> >> >>> > > >> >> >>> > > >> >> >>> > > >> >> >>> > > >> >> >>> > > >> >> >>> > > >> >> >>> > > > >> >>> > > > >> >>> > > > >> >>> > > > >> >>> > > > >> >>> > > > >> >>> > > > >> >>> > > > >> >>> > > > >> >>> > > > >> >>> > > > >> >>> > > > >> >>> > > > >> >>> > > > >> >>> > > >> >>> > >> >>> >>