PERFECT ++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Chris Mattmann, Ph.D. Adjunct Associate Professor, Computer Science Department University of Southern California Los Angeles, CA 90089 USA Email: [email protected] WWW: http://sunset.usc.edu/~mattmann/ ++++++++++++++++++++++++++++++++++++++++++++++++++++++++
-----Original Message----- From: Jarin Nitin Shah <[email protected]> Date: Monday, October 20, 2014 at 10:27 PM To: Chris Mattmann <[email protected]> Subject: Re: 572-Homework 2 Doubt >Hello Professor, > > >I have created a new page >https://cwiki.apache.org/confluence/display/OODT/Metadata+Extractors and >added > that link in the column File Manager / Crawler / PushPull of Catalog and >Archive section. >I hope this is correct :) > > >Thank You > > >On Mon, Oct 20, 2014 at 9:52 PM, Christian Alan Mattmann ><[email protected]> wrote: > >Thanks Jarin. No, but let me be more specific: :) > >Please remove that link on the main OODT page. >Note there is a section there Called “Catalog and Archive”. >If you click it, you will see: > >https://cwiki.apache.org/confluence/display/OODT/Catalog+and+Archive > >On _that_ page, > >1. in the column for File Manager / Crawler / PushPull, please add >a link to a page: >https://cwiki.apache.org/confluence/display/OODT/Metadata+Extractors > >(you will create this page) > >2. Create a new page: > >https://cwiki.apache.org/confluence/display/OODT/Metadata+Extractors > > >Copy the contents of the static web page that you link to below to >the wiki page above. > >Thanks and let me know if that is clear. > > >++++++++++++++++++++++++++++++++++++++++++++++++++++++++ >Chris Mattmann, Ph.D. >Adjunct Associate Professor, Computer Science Department >University of Southern California >Los Angeles, CA 90089 USA >Email: [email protected] >WWW: http://sunset.usc.edu/~mattmann/ >++++++++++++++++++++++++++++++++++++++++++++++++++++++++ > > > > >-----Original Message----- >From: Jarin Nitin Shah <[email protected]> >Date: Monday, October 20, 2014 at 9:33 PM >To: Chris Mattmann <[email protected]> >Subject: Re: 572-Homework 2 Doubt > >>Hello Professor, >> >> >>I have created a sub-page linked to the OODT home page that displays the >>content of Metadata Extractors. Is this correct? >> >> >>Thank You >> >> >>On Mon, Oct 20, 2014 at 9:18 PM, Christian Alan Mattmann >><[email protected]> wrote: >> >>Thanks - when I reviewed your last edit, it seemed like you >>copied it to the main OODT wiki page. You should create a >>sub-page (linked from the home page), that contains the >>content (it shouldn’t be directly on the OODT wiki home >>page). Does that make sense? >>++++++++++++++++++++++++++++++++++++++++++++++++++++++++ >>Chris Mattmann, Ph.D. >>Adjunct Associate Professor, Computer Science Department >>University of Southern California >>Los Angeles, CA 90089 USA >>Email: [email protected] >>WWW: http://sunset.usc.edu/~mattmann/ >>++++++++++++++++++++++++++++++++++++++++++++++++++++++++ >> >> >> >> >>-----Original Message----- >>From: Jarin Nitin Shah <[email protected]> >>Date: Monday, October 20, 2014 at 9:16 PM >>To: Chris Mattmann <[email protected]> >>Subject: Re: 572-Homework 2 Doubt >> >>>Hello Professor, >>> >>> >>>I have copied the contents of that page >>>http://oodt.apache.org/components/maven/metadata/user/basic.html#extract >>>o >>>r >>>s to the OODT wiki. >>> >>> >>>Thank You >>> >>> >>>On Mon, Oct 20, 2014 at 8:42 PM, Christian Alan Mattmann >>><[email protected]> wrote: >>> >>>Thanks! It would be great to get the contents of that page >>>copied to the wiki so folks could edit it directly there. >>> >>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++ >>>Chris Mattmann, Ph.D. >>>Adjunct Associate Professor, Computer Science Department >>>University of Southern California >>>Los Angeles, CA 90089 USA >>>Email: [email protected] >>>WWW: http://sunset.usc.edu/~mattmann/ >>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++ >>> >>> >>> >>> >>>-----Original Message----- >>>From: Jarin Nitin Shah <[email protected]> >>>Date: Monday, October 20, 2014 at 8:37 PM >>>To: Chris Mattmann <[email protected]> >>>Subject: Re: 572-Homework 2 Doubt >>> >>>>Hello Professor, >>>> >>>> >>>>I have added the link Metadata Extractors( CAS Metadata Project) under >>>>the File Manager / Crawler / PushPull list of >>>>Catalog and Archive section of OODT wiki page. >>>>It directly links to >>>>http://oodt.apache.org/components/maven/metadata/user/basic.html#extrac >>>>t >>>>o >>>>r >>>>s. >>>> >>>> >>>>Thank You >>>> >>>> >>>>On Mon, Oct 20, 2014 at 8:04 PM, Christian Alan Mattmann >>>><[email protected]> wrote: >>>> >>>>Perms granted! >>>> >>>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++ >>>>Chris Mattmann, Ph.D. >>>>Adjunct Associate Professor, Computer Science Department >>>>University of Southern California >>>>Los Angeles, CA 90089 USA >>>>Email: [email protected] >>>>WWW: http://sunset.usc.edu/~mattmann/ >>>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++ >>>> >>>> >>>> >>>> >>>>-----Original Message----- >>>>From: Jarin Nitin Shah <[email protected]> >>>>Date: Monday, October 20, 2014 at 7:56 PM >>>>To: Chris Mattmann <[email protected]> >>>>Subject: Re: 572-Homework 2 Doubt >>>> >>>>>Hello Professor, >>>>> >>>>> >>>>>Following are our usernames: >>>>>1. apraj >>>>>2. hemnani >>>>>3. jarinshah >>>>> >>>>> >>>>>I will let you know once I have made the changes. >>>>>Thank You >>>>> >>>>>On Mon, Oct 20, 2014 at 7:42 PM, Christian Alan Mattmann >>>>><[email protected]> wrote: >>>>> >>>>>Sure, please provide your usernames and I will add your >>>>>name on the permissions. >>>>> >>>>>Thanks and let me know. >>>>> >>>>>Cheers, >>>>>Chris >>>>> >>>>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++ >>>>>Chris Mattmann, Ph.D. >>>>>Adjunct Associate Professor, Computer Science Department >>>>>University of Southern California >>>>>Los Angeles, CA 90089 USA >>>>>Email: [email protected] >>>>>WWW: http://sunset.usc.edu/~mattmann/ >>>>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++ >>>>> >>>>> >>>>> >>>>> >>>>>-----Original Message----- >>>>>From: Jarin Nitin Shah <[email protected]> >>>>>Date: Monday, October 20, 2014 at 1:21 PM >>>>>To: Chris Mattmann <[email protected]> >>>>>Subject: Re: 572-Homework 2 Doubt >>>>> >>>>>>Hello Professor, >>>>>> >>>>>> >>>>>>We are thinking of porting this link >>>>>>http://oodt.apache.org/components/maven/metadata/user/basic.html#extr >>>>>>a >>>>>>c >>>>>>t >>>>>>o >>>>>>r >>>>>>sto the Calalog and Archive section of the OODT wiki page but it >>>>>>looks >>>>>>like we don't have permission to edit it. >>>>>>Can you please give us any details regarding editing the wiki page. >>>>>> >>>>>> >>>>>>Thank You >>>>>> >>>>>> >>>>>> >>>>>>On Sun, Oct 19, 2014 at 9:00 PM, Christian Alan Mattmann >>>>>><[email protected]> wrote: >>>>>> >>>>>>Great progress even getting this far, Jarin. You will need to use >>>>>>one of Apache OODT¹s met extractors. Note that they are present here: >>>>>> >>>>>>http://oodt.apache.org/components/maven/metadata/user/basic.html >>>>>> >>>>>>It would be great also if you could port some of this documentation >>>>>>to the OODT wiki: >>>>>> >>>>>>https://cwiki.apache.org/confluence/display/OODT/Home >>>>>> >>>>>> >>>>>>You should be able to leverage one of the default extractors or >>>>>>write your own. >>>>>> >>>>>>HTH! >>>>>> >>>>>>Cheers, >>>>>>Chris >>>>>> >>>>>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++ >>>>>>Chris Mattmann, Ph.D. >>>>>>Adjunct Associate Professor, Computer Science Department >>>>>>University of Southern California >>>>>>Los Angeles, CA 90089 USA >>>>>>Email: [email protected] >>>>>>WWW: http://sunset.usc.edu/~mattmann/ >>>>>>++++++++++++++++++++++++++++++++++++++++++++++++++++++++ >>>>>> >>>>>> >>>>>> >>>>>> >>>>>>-----Original Message----- >>>>>>From: Jarin Nitin Shah <[email protected]> >>>>>>Date: Sunday, October 19, 2014 at 8:54 PM >>>>>>To: Chris Mattmann <[email protected]> >>>>>>Subject: 572-Homework 2 Doubt >>>>>> >>>>>>>Hello Professor, >>>>>>> >>>>>>> >>>>>>>In Homework 2 pdf, you have written the following under Downloading >>>>>>>Apache OODT section: >>>>>>>You will be responsible for creating: >>>>>>>1. Product type information for your JSON file >>>>>>>2. Capturing basic file metadata about your JSON files that you >>>>>>>ingest >>>>>>>into Solr using ETLLib >>>>>>> >>>>>>> >>>>>>>But we couldn't find a program in ETLlib that provides basic file >>>>>>>metadata about the JSON files. >>>>>>>Are we supposed to use MimeTypeExtractor, CharsetExtractor from OODT >>>>>>>or >>>>>>>write a .met file manually for JSON? Or is there a program in Etllib >>>>>>>that >>>>>>>extracts metdata out of the box? >>>>>>> >>>>>>> >>>>>>>Appreciate your help. >>>>>>> >>>>>>> >>>>>>>-- >>>>>>>Thanks & Regards, >>>>>>> >>>>>>>Jarin Nitin Shah >>>>>>>Graduate Student at USC, >>>>>>>MS in Computer Science, >>>>>>>[email protected] >>>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>> >>>>>> >>>>>> >>>>>> >>>>>> >>>>>> >>>>>> >>>>>> >>>>>> >>>>>> >>>>>>-- >>>>>>Thanks & Regards, >>>>>> >>>>>>Jarin Nitin Shah >>>>>>Graduate Student at USC, >>>>>>MS in Computer Science, >>>>>>[email protected] >>>>>> >>>>>> >>>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>>>>-- >>>>>Thanks & Regards, >>>>> >>>>>Jarin Nitin Shah >>>>>Graduate Student at USC, >>>>>MS in Computer Science, >>>>>[email protected] >>>>> >>>>> >>>>> >>>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>>-- >>>>Thanks & Regards, >>>> >>>>Jarin Nitin Shah >>>>Graduate Student at USC, >>>>MS in Computer Science, >>>>[email protected] >>>> >>>> >>>> >>> >>> >>> >>> >>> >>> >>> >>> >>> >>> >>>-- >>>Thanks & Regards, >>> >>>Jarin Nitin Shah >>>Graduate Student at USC, >>>MS in Computer Science, >>>[email protected] >>> >>> >>> >> >> >> >> >> >> >> >> >> >> >>-- >>Thanks & Regards, >> >>Jarin Nitin Shah >>Graduate Student at USC, >>MS in Computer Science, >>[email protected] >> >> >> > > > > > > > > > > >-- >Thanks & Regards, > >Jarin Nitin Shah >Graduate Student at USC, >MS in Computer Science, >[email protected] > > >
