sorry, meant to send this to the group.
Sorry you're getting it twice, Laurent.




----- Forwarded Message ----
From: don undeen <donund...@yahoo.com>
To: Laurent Alquier <laur...@alquier.org>
Sent: Thu, November 4, 2010 1:51:49 PM
Subject: Re: [SMW-devel] Interest in SMW in Museum/Cultural sector


Hi Laurent,
Sounds like you are doing work very similar to me. I'm also spending quite a 
bit 
of time on customization, upgrades, bugtracking, etc.
Our museum has in the range of 500,000 art objects. To say nothing of the 
number 
of artists, donors, materials used, locations, as well as all the data from 
OTHER museums that would be involved in data integration. PLUS all the 
relations 
between them. 

I'm obviously nowhere near using all of those, and I split my wikis into 
multiple, purpose-oriented wikis. I've got wikis with 800, 1.5k, 26k, and 47k 
pages, and that's not even close to where I need to be. Currently running on 
VMs 
with 4G of ram. On those larger wikis, the queries don't run so well, but at 
least I can view pages.
So when I talk about scaling, that's what I'm dealing with.  potentially on the 
order of millions of pages. Or, at least I need to know how high I can 
reasonably scale, and what hardwares involved.
If I knew that we could spend $X, and then actually be able to fully scale to 
millions of pages, with full querying capabilities, I could probably get that 
money, almost no matter how big X was. This institution finds plenty of money 
for less worthy projects.

But, if the performance degrades geometrically, then there's a hard limit to 
what we can do with any amount of money. 


My problem is, I got my current servers for asking nicely, because they're VMs 
partitioned from a large resource. Without some better information on scaling, 
it's hard for me to get a real budget allocated to experiment.

In any case, I'm getting 3 more VMs shortly (probably need to buy the admin a 
six-pack of Chimay Blue), at which point I can play a bit more with 
configuration and performance. I'm NO sysadmin of any sort,  just now learning 
how to tweak the mysql server.

Which bring me to a question:. say I've got two wikis of equal size, and two 
VMs, on the same physical hardware. Is it better to put both wikis on one 
machine, and both dbs on another, or to put each wiki with its db on the same 
server? I think I just revealed the depth of my ignorance here, but oh well...

Any other easy tweaking advice would be really appreciated. I've played a bit 
with the index buffer size on mysql. Any other ideas?

Don





________________________________
From: Laurent Alquier <laur...@alquier.org>
To: don undeen <donund...@yahoo.com>
Sent: Thu, November 4, 2010 12:47:04 PM
Subject: Re: [SMW-devel] Interest in SMW in Museum/Cultural sector

Hi Don

I have been experimenting a lot with using SMW for Enterprise Data Integration 
in a production environment.

The good news is, like you, I am a convert to how flexible SMW is for data 
integration. At the moment, I am slowly pushing it for 1/ an enterprise 
semantic 
wikipedia and 2/ a gateway to author Linked Data without having to know 
anything 
about RDF and SPARQL (at least, as far as authors of content are concerned).

I have not yet reached the breaking point of SMW on our setup. We are using a 5 
years old server, with a load of about 50 unique visits a day, 4500 pages and 
about 25 active contributors. That's also good news - no need for top of the 
line servers to get a decent amount of work done.

The bad news is that while SMW is indeed very cheap, it comes at the hidden 
cost 
of the need for an almost dedicated resources to tune it, improve it and make 
it 
successful. I am glad that I can patch the code and mold it to the way I want 
it 
to work, but it still requires a lot of attention.

We are using highly customized skins, custom code to improve on existing 
extensions or to simply make them work in our environment. The result is a very 
streamlined experience for users but a lot of work behind the scene as we are 
making it grow.

Regards
- Laurent


On Thu, Nov 4, 2010 at 12:21 PM, don undeen <donund...@yahoo.com> wrote:

Hi all,
>I just thought I'd give you an update on some presentations I did this past 
>week 
>in the museum/cultural sector on SMW, and the excellent response I've been 
>receiving there.
>
>The first talk I gave was as the Museum Computer Network Conference, MCN2010. 
>The title of the talk was "Semantic Mediawiki for Easy Data Integration." My 
>general thesis was that a combination of templates, forms, auto-page-creating 
>properties, and the ExternalData extension, can allow us to integrate data 
>from 
>a variety of sources, without doing any up-front database design or data 
>mapping. The talk was short (5 minutes), so I don't have many slides. But 
>after 
>the talk was a table session where I answered questions from those in the 
>audience who were interested (they had 4 topics to choose from). My table  was 
>by far the most populated, i'd say about 30 people from museums and cultural 
>institutions around the world were very interested, and engaged me with 
>challenging questions regarding this approach. Museums have a big job trying 
>to 
>communicate information with each other regarding their objects; a typical 
>project is very task-specific (say, a particular website or research project), 
>and involves massive data transfer, as well as data standards negotiation, 
>target database design, data mapping, and vendor relationships. Lots of 
>museums 
>don't have the resources to do it at all. They really liked the idea that a 
>SMW 
>installation could be a low barrier-to-entry "sandbox" where they could put 
>all 
>this data together and see how well it integrates. 
>
>I didn't try to say "this is your data integration app," but rather "this is a 
>tool that let's you SEE the data AS you're doing the planning."
>The reason I played down the enterprise-level quality of SMW is  that frankly 
>I 
>don't have good data on scalability. Museums have a LOT of data, and I don't 
>have the resources here to really scale out my wiki that large. Plus I haven't 
>resolved some bugs, usability, and UI issues; that may be my fault (not 
>keeping 
>up with all the upgrades), or rough edges in SMW.
>
>Just letting you all know that the interest is there in this industry, for 
>this 
>problem space. If someone can demonstrate scalability and a streamlines user 
>experience, they could really do good work with museums. 
>
>
>I'll be putting together a more detailed document on how I set all this up.
>
>
>Also, I went to a smaller meeting in Washington DC, to discuss the development 
>of a documentation management system for Museum Conservators. Another 
>international group, with funding for development. In my own museum, I've been 
>using SMW for rapid prototyping and data staging. I'll be using it in this 
>project for data staging; but I don't feel that  the SMW is flexible enough in 
>the UI to use it for app prototyping for this project. Also there are a few 
>conflicts between this software's projected workflow, and what SMW supports. 
>Again, maybe this will turn out not to be true. But in any case, I'll be 
>putting 
>the SMW software in front of these heavy-duty knowledge workers (conservators 
>are actually scientists with heavy data analysis needs). I'll let you know 
>what 
>kind of feedback I get from them.
>
>Again, I'll be putting together a doc (a different one, focused on 
>prototyping)  
>that describes this software setup in more detail.
>
>anyways, just thought I'd let you know that there's now this new group of 
>people 
>out there who are using the term "Semantic Wiki" with a gleam in their eyes. 
>:) 
>. And hopefully my institution will be allocating more of my time to work with 
>SMW.
>
>Let me know if you've got any questions.
>
>cheers all, and thanks for making me look good!
>
>Don  Undeen
>Metropolitan Museum of Art
>
>------------------------------------------------------------------------------
>The Next 800 Companies to Lead America's Growth: New Video Whitepaper
>David G. Thomson, author of the best-selling book "Blueprint to a
>Billion" shares his insights and actions to help propel your
>business during the next growth cycle. Listen Now!
>http://p.sf.net/sfu/SAP-dev2dev
>_______________________________________________
>Semediawiki-devel mailing list
>Semediawiki-devel@lists.sourceforge.net
>https://lists.sourceforge.net/lists/listinfo/semediawiki-devel
>
>


-- 
- Laurent Alquier
http://www.linfa.net
------------------------------------------------------------------------------
The Next 800 Companies to Lead America's Growth: New Video Whitepaper
David G. Thomson, author of the best-selling book "Blueprint to a 
Billion" shares his insights and actions to help propel your 
business during the next growth cycle. Listen Now!
http://p.sf.net/sfu/SAP-dev2dev
_______________________________________________
Semediawiki-devel mailing list
Semediawiki-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/semediawiki-devel

Reply via email to