I have just come across Perkeep and haven't downloaded anything yet as I'm 
just trying to figure how it fits into what I'm looking for. Note that I 
don't require the complete solution but that I am just looking to 
understand what part(s) Perkeep can manage for me. At this stage I think 
the only part is that of Aim 6, but I could be wrong...

My current set-up...

I am a home users running a Windows Server 2012 R2 machine with close to 
12TB of personal data that I need to deduplicate as well as organise and 
remote gain access to. I also have some data on Dropbox and Mega that I 
need to consolidate as part of my data. Some of this data across all of 
these systems are currently in a mess with duplicated files but also 
duplicate but renamed files and variations of files (edited versions, etc.).

My long-term aims are as follows...

Aim 1: Catalogue/preprocess all files to generate a database of paths, file 
names, hashes, time-stamps, and other metadata - when failing to locate a 
high-level metadata attribute, the process could be scripted to fall back 
on the next best thing, for example, if lacking EXIF timestamp, then try to 
evaluate the timestamp from the file name, and, if that fails, use the file 
created time-stamp whilst also allowing for user input and user-assigned 
metadata and tags to aid the process.

Aim 2: Identify duplicate file sets in the database using the prepossessed 
information allowing for user input and user-assigned metadata and tags to 
aid the process and to then include this information in the database.

Aim 3: Using file contents, metadata, and tags, identify version sets of 
files and include this information in the database - parts of this 
identification process can be automated, for example, an image can 
programmatically be found to be rotated and/or compressed loss-illy, but in 
some cases version identification would fall back on a manual 
identification process allowing for user input and user-assigned metadata 
and tags.

Aim 4: Organise files based on the database deduplicating it in the 
process, i.e., physically and safely move and/or delete files to folders 
and file-names based on original file path(s), file name(s), metadata, and 
tags whilst allowing for user input to manage and prevent file clashes so 
the process can be executed without any issues - this process would also 
write tags to file formats that support such data whilst it could provide 
the option for sidecar metadata files and/or individual hash-tag files with 
the tag names (this is how I currently add cross-platform tag-support at 
the file system level, i.e., I create individual null-byte files named 
#<tag-name> alongside the original file for video files which are all 
placed in their individual folders to keep the tags grouped but these could 
just as well be grouped by including the file name prefixed to the file 
name of the has-tag file name, i.e., <file-name>#<tag-name>).

Aim 5: Gain cross-platform access across the Internet to all organised 
files preferably with user-based access permissions.

Yes, I realise this is way more than what Perkeep was most likely designed 
to do, but I'm interested not only in what part(s) Perkeep can solve but 
possible ideas and input people may have in solving some of the tasks I 
need to solve.


Cheers,
-Andreas

-- 
You received this message because you are subscribed to the Google Groups 
"Perkeep" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/d/optout.

Reply via email to