Hi Adam, Thanks for the response. From your info it sounds like a shouldn't have too much of a problem. Some of our video ingests will need to happen through some kind of web interface, so I was thinking ahead to user experience, and whether or not I would have to build-in any progress screens while a checksum was created. Looks like I won't have to worry.
It'd still be a good idea to build some kind of progress interface - if only because the API request for the ingest might time out before the ingest is completed. I've been meaning to do some more testing in this area but haven't yet done so, so once you get to that stage, anything you can share would be useful. Our really large video files will most likely be ingested via scripts or a some kind of network drop box, so timing isn't so much of an issue there. That sounds reasonable. We've experimented with encoding our video straight into a WebDAV share, which Fedora can access. That way, we can build FoXMLs with '<foxml:contentLocation>' tags pointing to the HTTP URLs of the encoded items and Fedora will download them, rather than relying on POSTing large files through the REST or API-M interfaces. An alternative is the file:// URI schema, which could be faster if the files are already locally addressable. The other bug which you may run into on 3.3 is FCREPO-696 [1] - also related to checksums. This is fixed in 3.4. Also, the Flex client has an upload bug relating to managed content datastreams [2], though this doesn't affect the API. Regards, Graeme [1] https://jira.duraspace.org/browse/FCREPO-696 [2] https://jira.duraspace.org/browse/FCREPO-605 On 25 Oct 2010, at 17:48, Adam Wead wrote: Hi Graeme, Thanks for the response. From your info it sounds like a shouldn't have too much of a problem. Some of our video ingests will need to happen through some kind of web interface, so I was thinking ahead to user experience, and whether or not I would have to build-in any progress screens while a checksum was created. Looks like I won't have to worry. Our really large video files will most likely be ingested via scripts or a some kind of network drop box, so timing isn't so much of an issue there. I'm also currently developing with version 3.3 and will soon move to 3.4 to avoid the bugs you mentioned. thanks again! ...adam On Mon, Oct 25, 2010 at 4:59 AM, West, Graeme <graeme.w...@gcu.ac.uk<mailto:graeme.w...@gcu.ac.uk>> wrote: Hi Adam, I can't answer the question regarding threading in any technical detail, but I gather that in general Fedora is multi-threaded, so multiple ingest requests will be handled separately. I think it's the case though that when ingesting multiple objects simultaneously, the shell utilities provided with Fedora (e.g. passing a 'd' flag to fedora-ingest.sh ) work sequentially - so you may want to make API requests directly rather than using these if you want to do things in parallel. We're using Fedora for storage of video files - mostly compressed MPEG-2, so in the 800MB - 4GB file size range. We use SHA-512 checksums without any issues. I generally monitor ingests through a combination of Java VisualVM and iotop to get an impression of what's IO-centric and what's CPU-centric: on ingest of 2 - 4GB files, checksumming is pretty quick - 1 to 5 seconds at the end of an ingest cycle. That's on a Xen VM running on a dual-quad core Xeon physical machine. Just to note, Fedora 3.4 fixes a bug in the REST interface in relation to files over 2GB [1]. Also, versions prior to 3.2 (I think) will generally attempt to read entire datastreams into RAM on ingest, which isn't a great idea with large files. There is also an outstanding JIRA issue relating to streaming files directly into the back-end filestore, which if implemented would speed up large ingests considerably [2]. We have tested 3.4 ingesting up to 45GB files with no particular performance issues, and these ingests included checksumming. Regards, Graeme [1] https://jira.duraspace.org/browse/FCREPO-704 [2] https://jira.duraspace.org/browse/FCREPO-495 On 22 Oct 2010, at 19:02, Adam Wead wrote: Hi all, How does the checksum generation process for very large files, like video files of 50 GB, affect Fedora's performance? The scenario I'm thinking about is if you're ingesting a batch of large files into Fedora, will it fork a new process to generate the checksum in the background while proceeding on to the next file, or will it only proceed to the next file after the checksum for the first file has been generated. Anyone care to chime in on checksums in general for video files? I've used them before on audio files without too much trouble, but the size of video files might present problems. thanks, ...adam Email has been scanned for viruses by Altman Technologies' email management service<http://www.altman.co.uk/emailsystems> ------------------------------------------------------------------------------ Nokia and AT&T present the 2010 Calling All Innovators-North America contest Create new apps & games for the Nokia N8 for consumers in U.S. and Canada $10 million total in prizes - $4M cash, 500 devices, nearly $6M in marketing Develop with Nokia Qt SDK, Web Runtime, or Java and Publish to Ovi Store http://p.sf.net/sfu/nokia-dev2dev Email has been scanned for viruses by Altman Technologies' email management service - www.altman.co.uk/emailsystems<http://www.altman.co.uk/emailsystems> _______________________________________________ Fedora-commons-users mailing list Fedora-commons-users@lists.sourceforge.net<mailto:Fedora-commons-users@lists.sourceforge.net> https://lists.sourceforge.net/lists/listinfo/fedora-commons-users Email has been scanned for viruses by Altman Technologies' email management service - www.altman.co.uk/emailsystems<http://www.altman.co.uk/emailsystems> Glasgow Caledonian University is a registered Scottish charity, number SC021474 Winner: Times Higher Education's Widening Participation Initiative of the Year 2009 and Herald Society's Education Initiative of the Year 2009 http://www.gcu.ac.uk/newsevents/news/bycategory/theuniversity/1/name,6219,en.html ------------------------------------------------------------------------------ Nokia and AT&T present the 2010 Calling All Innovators-North America contest Create new apps & games for the Nokia N8 for consumers in U.S. and Canada $10 million total in prizes - $4M cash, 500 devices, nearly $6M in marketing Develop with Nokia Qt SDK, Web Runtime, or Java and Publish to Ovi Store http://p.sf.net/sfu/nokia-dev2dev _______________________________________________ Fedora-commons-users mailing list Fedora-commons-users@lists.sourceforge.net<mailto:Fedora-commons-users@lists.sourceforge.net> https://lists.sourceforge.net/lists/listinfo/fedora-commons-users -- ~/amsterdam Email has been scanned for viruses by Altman Technologies' email management service<http://www.altman.co.uk/emailsystems> ------------------------------------------------------------------------------ Nokia and AT&T present the 2010 Calling All Innovators-North America contest Create new apps & games for the Nokia N8 for consumers in U.S. and Canada $10 million total in prizes - $4M cash, 500 devices, nearly $6M in marketing Develop with Nokia Qt SDK, Web Runtime, or Java and Publish to Ovi Store http://p.sf.net/sfu/nokia-dev2dev Email has been scanned for viruses by Altman Technologies' email management service - www.altman.co.uk/emailsystems _______________________________________________ Fedora-commons-users mailing list Fedora-commons-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/fedora-commons-users Email has been scanned for viruses by Altman Technologies' email management service - www.altman.co.uk/emailsystems Glasgow Caledonian University is a registered Scottish charity, number SC021474 Winner: Times Higher Education's Widening Participation Initiative of the Year 2009 and Herald Society's Education Initiative of the Year 2009 http://www.gcu.ac.uk/newsevents/news/bycategory/theuniversity/1/name,6219,en.html ------------------------------------------------------------------------------ Nokia and AT&T present the 2010 Calling All Innovators-North America contest Create new apps & games for the Nokia N8 for consumers in U.S. and Canada $10 million total in prizes - $4M cash, 500 devices, nearly $6M in marketing Develop with Nokia Qt SDK, Web Runtime, or Java and Publish to Ovi Store http://p.sf.net/sfu/nokia-dev2dev _______________________________________________ Fedora-commons-users mailing list Fedora-commons-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/fedora-commons-users