I have just come across Perkeep and haven't downloaded anything yet as I'm just trying to figure how it fits into what I'm looking for. Note that I don't require the complete solution but that I am just looking to understand what part(s) Perkeep can manage for me. At this stage I think the only part is that of Aim 6, but I could be wrong...
My current set-up... I am a home users running a Windows Server 2012 R2 machine with close to 12TB of personal data that I need to deduplicate as well as organise and remote gain access to. I also have some data on Dropbox and Mega that I need to consolidate as part of my data. Some of this data across all of these systems are currently in a mess with duplicated files but also duplicate but renamed files and variations of files (edited versions, etc.). My long-term aims are as follows... Aim 1: Catalogue/preprocess all files to generate a database of paths, file names, hashes, time-stamps, and other metadata - when failing to locate a high-level metadata attribute, the process could be scripted to fall back on the next best thing, for example, if lacking EXIF timestamp, then try to evaluate the timestamp from the file name, and, if that fails, use the file created time-stamp whilst also allowing for user input and user-assigned metadata and tags to aid the process. Aim 2: Identify duplicate file sets in the database using the prepossessed information allowing for user input and user-assigned metadata and tags to aid the process and to then include this information in the database. Aim 3: Using file contents, metadata, and tags, identify version sets of files and include this information in the database - parts of this identification process can be automated, for example, an image can programmatically be found to be rotated and/or compressed loss-illy, but in some cases version identification would fall back on a manual identification process allowing for user input and user-assigned metadata and tags. Aim 4: Organise files based on the database deduplicating it in the process, i.e., physically and safely move and/or delete files to folders and file-names based on original file path(s), file name(s), metadata, and tags whilst allowing for user input to manage and prevent file clashes so the process can be executed without any issues - this process would also write tags to file formats that support such data whilst it could provide the option for sidecar metadata files and/or individual hash-tag files with the tag names (this is how I currently add cross-platform tag-support at the file system level, i.e., I create individual null-byte files named #<tag-name> alongside the original file for video files which are all placed in their individual folders to keep the tags grouped but these could just as well be grouped by including the file name prefixed to the file name of the has-tag file name, i.e., <file-name>#<tag-name>). Aim 5: Gain cross-platform access across the Internet to all organised files preferably with user-based access permissions. Yes, I realise this is way more than what Perkeep was most likely designed to do, but I'm interested not only in what part(s) Perkeep can solve but possible ideas and input people may have in solving some of the tasks I need to solve. Cheers, -Andreas -- You received this message because you are subscribed to the Google Groups "Perkeep" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/d/optout.
