For DSpace disk space allocation, you will need to consider the following: 1) size of your content 2) size of extracted text from your content 3) size of metadata 4) size of the software itself and its dependencies 5) size of the logs 6) size of usage events (for statistics)
In practice, you only have to think about 1), 2), 5) and 6), the rest is fairly small (the rest combined will likely fit in 1 GB for a mid-sized repository of ~100,000 items). 1) This is self-explanatory. DSpace will need as much disk space as the size of the contents you want to store in it. No more, no less. 2) If you decide to run filter-media to extract full-text from your content (PDFs, docs and some others), you will need to allocate space for the extracted text files, which will be smaller than the source content files. Count this text content twice because it will be stored as text files in the assetstore and as part of the documents in the Solr search core. 5) Logging is enabled by default and the logs can grow by dozens of gigabytes per year for a mid-sized repo, depending on how much it's accessed. You'll want to archive log off-site or delete old logs. 6) Usage events will be recorded in the Solr statistics core which you'll probably want to keep and will need ~1 GB/year for a mid-sized repo. DSpace doesn't do anything special for video or streaming, so there's no need to allocate disk space for that, unless you decide to add such functionality to DSpace yourself. Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- You received this message because you are subscribed to the Google Groups "DSpace Technical Support" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/dspace-tech. For more options, visit https://groups.google.com/d/optout.
