Hi Scott, I don't have numbers for you at this point, but we've been using ActiveMQ as a queueing / message broker to loosely couple systems that participate, along with Fedora, in various collection management activity. We're planning to ingest a collection of 50k images and can report on how that goes by the end of the year, probably in November.
Here's a longer description of broader context and what we've done so far in case of interest. I'd be very interested in comments or similar experience you or others have. The general idea is to let various systems report significant events via JMS with the event handling to be loosely coupled (some operations take a while), extensible and flexibly/centrally managed externally to the systems. The motivation is that we have various systems contributing repository content (Drupal, Archon, and Voyager for starters), we need to synchronize metadata updates (descriptive metadata and also authority references) and want to minimize the system-specific customization necessary for systems to communicate/ participate. For example, we have started to use Drupal for creating repository content, using a CCK content type to enter/edit media file paths and repeating Dublin Core fields. We use a modified version of the MsgQueue Drupal module to queue insert/update/delete node_api events to ActiveMQ for Drupal configured CCK types of interest. Currently the message includes the node and CCK data, as the initial content types are simple and relatively small (Dublin Core). The JMS handler for these events invokes ingest scripts on a conversion/ingest- preprocessing server that obtain the media (images currently), JHove validate it, extract technical metadata, convert to jp2, push the jp2 to image server, push the source file to archive server, generate foxml with datastreams (DC, MODS, SVG) and ingest into Fedora. In this scenario Drupal doesn't have to know about Fedora or how to invoke the various ingest preprocessing. It just reports events of interest. We plan to expand event handling to include solr indexing and preservation metadata capture. We also use the Drupal Node Import module to trigger such ingests from a spreadsheet. We're thinking about modifying this module to take fields from an ODBC/JDBC source and from file systems structures (directories of images for example). We're planning to test the ingest of a collection of 50K images in this way. We can report on how that goes. We have a slightly different pattern for EAD objects created/edited in Archon. Since finding aid EAD data is more complex than DC, in this case Archon will just send a message to the JMS broker that an Archon collection should be synchronized to Fedora. The JMS handler will turn around and make a request to Archon for the collection as an EAD export and ingest/update appropriately into Fedora. Haven't dine this yet, but it's due by the end of this week.:) It's almost there. We've been talking about synchronizing authority data similarly, for example when we get updates from OCLC for our local name authority file in Voyager that should propagate to affected records in Archon, Drupal, and Fedora. We're still talking this through, but it's part of the motivation to handle synchronization processes in a JMS message broker. I'd be very interested in comments, anecdotes, benchmarks, suggestions in this approach. Thanks, Bill On Oct 27, 2009, at 8:09 AM, Scott Prater wrote: > We'll soon be activating JMS in our Fedora installation, in order to > update our Solr index and memory cache on purge/ingest/update events. > Has anyone had any other experience using a third-party message > broker? > In particular, does anyone have any experience using a message broker > in situations of high traffic or with large datasets or heavy batch > operations? Any recommendations or gotchas to look out for? > > TIA, > > -- Scott > > aj...@virginia.edu wrote: >> I may have missed part of the conversation where you describe this, >> but are you using the JMS gear that comes with Fedora or a separate >> JMS broker (e.g. ActiveMQ)? >> >> --- >> A. Soroka >> Digital Research and Scholarship R & D >> the University of Virginia Library >> >> >> >> On Oct 26, 2009, at 6:56 PM, <carsten.friedr...@csiro.au> wrote: >> >>> · Using the message queue with fedoragsearch to update SOLR >>> seems not to be a good idea for larger datasets. I suspect using the >>> message queue for anything with a large datasets is not a good idea. >>> It seems to consume a lot of resources when running, does not seem >>> to shut down reliably (resulting in very long rebuild time next time >>> you start tomcat) and seems to scale very badly (performance goes >>> down significantly the more objects we add). There are a lot of >>> “seems” in this statement because we didn’t actually measure this >>> and it is only based on our subjective observation. >> >> >> ------------------------------------------------------------------------------ >> Come build with us! The BlackBerry(R) Developer Conference in SF, CA >> is the only developer event you need to attend this year. Jumpstart >> your >> developing skills, take BlackBerry mobile applications to market >> and stay >> ahead of the curve. Join us from November 9 - 12, 2009. Register now! >> http://p.sf.net/sfu/devconference >> _______________________________________________ >> Fedora-commons-users mailing list >> Fedora-commons-users@lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/fedora-commons-users > > > -- > Scott Prater > Library, Instructional, and Research Applications (LIRA) > Division of Information Technology (DoIT) > University of Wisconsin - Madison > pra...@wisc.edu > > > ------------------------------------------------------------------------------ > Come build with us! The BlackBerry(R) Developer Conference in SF, CA > is the only developer event you need to attend this year. Jumpstart > your > developing skills, take BlackBerry mobile applications to market and > stay > ahead of the curve. Join us from November 9 - 12, 2009. Register now! > http://p.sf.net/sfu/devconference > _______________________________________________ > Fedora-commons-users mailing list > Fedora-commons-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/fedora-commons-users > ------------------------------------------------------------------------------ Come build with us! The BlackBerry(R) Developer Conference in SF, CA is the only developer event you need to attend this year. Jumpstart your developing skills, take BlackBerry mobile applications to market and stay ahead of the curve. Join us from November 9 - 12, 2009. Register now! http://p.sf.net/sfu/devconference _______________________________________________ Fedora-commons-users mailing list Fedora-commons-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/fedora-commons-users