Hello Vlastimil,

Unfortunately, the size of DSpace sites is very difficult to track overall (it 
relies entirely on self reporting).

I know there are very large sites out there... a few that come to mind are U of 
Cambridge 
(https://www.repository.cam.ac.uk<https://www.repository.cam.ac.uk/>), and 
Georgetown University (https://repository.library.georgetown.edu/).  I cannot 
claim to know exactly how large the sites are though, as each of these sites 
may have access restricted content (which is not even visible on the web).  
However, in terms of public content alone each has 250-350 thousand items.

I also admit that I don't know whether there are larger sites out there.  But, 
maybe institutions on this mailing list will self-report if they have more than 
400 thousand items. (I know I'd love to hear which sites have >400K items!)

I think Mark Wood gave a thorough answer regarding the number of items possible 
in a DSpace.  Technically, the biggest limitation is the amount of server space 
& memory available (as larger sites need more of each).  For each release we 
attempt to make DSpace as performant (and memory lean) as we can, and as memory 
issues are reported we resolve them as bugs in a new release.  For example, for 
the upcoming DSpace 7 release (which is still under active development) we are 
running more detailed performance testing as detailed here: 
https://wiki.duraspace.org/display/DSPACE/DSpace+7+Performance+Testing   At 
this time, that performance testing is more geared towards minimizing CPU load 
and memory overall (which will also help in scaling).

Tim

________________________________
From: [email protected] <[email protected]> on 
behalf of Vlastimil Krejčíř <[email protected]>
Sent: Friday, August 23, 2019 5:57 AM
To: DSpace Community <[email protected]>
Subject: [dspace-community] Scalability of DSpace

Hi all,

back in April 2013 I asked the community about the DSpace scalability, see:

http://dspace.2283337.n4.nabble.com/DSpace-scalability-tens-of-hundreds-TBs-tt4662988.html#a4663047

Now, at 2019, it is time to ask the same question :-).

How much data / how many items can DSpace handle? The DSpace system at 
Cambridge University (https://www.repository.cam.ac.uk/) was reported as the 
largest then. I can see it stores about 245 thousands of items nowadays.

Does anyone else have bigger one? Are there new information on scalability 
since 2013?

Regards,

Vlastik Krejčíř

--
----------------------------------------------------------------------------
Vlastimil Krejčíř
Library and Information Centre, Institute of Computer Science
Masaryk University, Brno, Czech Republic
Email: krejcir (at) ics (dot) muni (dot) cz
Phone: +420 549 49 3872
OpenPGP key: https://kic-internal.ics.muni.cz/~krejvl/pgp/
Fingerprint: 7800 64B2 6E20 645B 56AF  C303 34CB 1495 C641 11B9
----------------------------------------------------------------------------


--
All messages to this mailing list should adhere to the DuraSpace Code of 
Conduct: https://duraspace.org/about/policies/code-of-conduct/
---
You received this message because you are subscribed to the Google Groups 
"DSpace Community" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to 
[email protected]<mailto:[email protected]>.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/dspace-community/a37b7af1-59eb-4a7e-b302-196cadbed7a0%40googlegroups.com<https://groups.google.com/d/msgid/dspace-community/a37b7af1-59eb-4a7e-b302-196cadbed7a0%40googlegroups.com?utm_medium=email&utm_source=footer>.

-- 
All messages to this mailing list should adhere to the DuraSpace Code of 
Conduct: https://duraspace.org/about/policies/code-of-conduct/
--- 
You received this message because you are subscribed to the Google Groups 
"DSpace Community" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/dspace-community/DM5PR22MB05727332D082F1B9BEB443BCEDA40%40DM5PR22MB0572.namprd22.prod.outlook.com.

Reply via email to