[Mayan EDMS: 355] Re: Importing directory tree structure

James Hondo Wed, 07 Nov 2012 08:21:10 -0800

Maarten, thanks for the suggestion I will give it a try.  Roberto, any 
estimate when this might get added?  Would be really helpful not only to me 
but I'm sure that for a lot of people as well.


On an unrelated note Alfresco 4.2 came out, take a look at the release 
notes, see if you notice something :-)

James

On Friday, November 2, 2012 12:02:03 PM UTC-4, Roberto Rosario wrote:
>
> Thanks Maarten that's actually the way it could be accomplished with the 
> current code.  The TO DO list is quite extensive so I can't promise a date 
> for this, but based on Maarten's script idea and adding 3 new options 
> to the bulk_upload command this could be done automatically.
>
> --dir_regex_metadata: to interpret a document's path as metadata values 
> using regular expression 
> --from_path: to import documents from a directory instead of a zipped file
> --recursive: to traverse all subdirectories
>
> so for something like:
>
> /project number 01/customer/Customer Name A/
> /project number 01/customer/Customer Name B/
> /project number 02/customer/Customer Name C/
>
> My regex Fu is poor but the command line would be more or less like this:
>
> $ ./manage.py bulk_upload --noinput --dir_regex_metadata "/project number 
> (?P<project_number>\d+)/customer/(?P<customer_name>[a-zA-Z0-9 
> ]+)"  --document_type "Accounting documents" --from_path 
> /var/accounting/docs/ --recursive
>
> with the regular expression parameter name (the name inside ?P<  >) being 
> the internal metadata type name.
>
> Thoughts?
>
> --Roberto
>
>
> On Friday, November 2, 2012 3:36:01 AM UTC-4, maarten wrote:
>>
>> Hi Roberto and James,
>>  
>> I agree on Roberto's statement on the filesystem. That is exactly the 
>> reason why filesystem directory trees always end in a mess; as soon as more 
>> than one directory (='metadata') can be applicable users will choose 
>> randomly. In case of a DMS you can then just use a second, third metadata 
>> tag.
>> However, I do understand James' problem, very common in a transition, as 
>> the best 'metadata' he currently has is the interpretation of the directory 
>> tree. E.g. a structure "project number XX\sales\quotations" are in fact 3 
>> metadata tags to be filled.
>>  
>> You could try a script where the arguments for -- metadata at bulk_upload 
>> are filled by some logic from a ls command. I've never tried (and am really 
>> not enough up to speed on pyhton if it wouldn't work to achieve this) but 
>> this command with multiple --metadata arguments could then do the trick if 
>> ran per subfolder:
>>  
>> $ ./manage.py bulk_upload --noinput --metadata '{"project": "bulk"}' 
>> --document_type "Accounting documents" compressed.zip
>>  
>> After this the magick of indexes could rebuild the original directory 
>> structure but then with many more usefull cross-sections of your document 
>> metadata.
>>  
>> Maarten
>>  
>>
>> On Wednesday, October 24, 2012 12:31:14 AM UTC+2, Roberto Rosario wrote:
>>
>>> No, the directory structure would not be 'cloned'.  I added this to the 
>>> TO DO list for future versions, but I'm a little hesitant to add it, 
>>> because it would just be duplicating the inefficient paradigm of filesystem 
>>> directory trees only on a web interface.  This is the reason I created the 
>>> automatic indexing where Mayan creates a hierarchical structure based on 
>>> user defined rules to help users avoid being slaves of a manually updated 
>>> structure as other DMS software do.  Is a little work at the beginning 
>>> while you create the rules, but then you don't have to ever worry again 
>>> about documents being placed in the correct hierarchical unit.
>>>
>>> It is not an acusasion, is it a document fact: 
>>> http://news.cnet.com/8301-1023_3-57514677-93/corruption-in-wikiland-paid-pr-scandal-erupts-at-wikipedia/
>>>  It 
>>> has long been suspected that this was happening, the episode in the link is 
>>> the most documented and alarming as it was done by a very senior Wikipedia 
>>> editor.  The editor received payment to edit and favor the page of the 
>>> government of Gibraltar so that it would be featured in Wikipedia's front 
>>> page.  An article is lucky to land on the front page, the page of the 
>>> country of Gibraltar landed 17 times, boosting their SEO results sky high. 
>>>  What is most disgusting is how the editor involved and others argue that 
>>> getting paid for favorably editing (or dumbing down a competitor's article) 
>>> is not a conflict of interest!
>>>
>>> With this information in mind, now read Mayan's previous wikipedia 
>>> article discussion here: 
>>> http://en.wikipedia.org/wiki/Wikipedia:Articles_for_deletion/Mayan_edms  
>>> It is extensive but you can clearly see how: 1) The article was tagged for 
>>> deletion from the start even when articles are usually moved to an 
>>> 'inactive' (userfied) mode where defenders can keep improving it at 
>>> resubmit it for evaluation again, 2) The criteria for deletion was produced 
>>> out of thin air, 3) The existing articles for commercial DMS software would 
>>> fail that same criteria, 4) The editors were not following Wikipedia's code 
>>> of conduct (accussing new users of being SPA's) , 5) Confused the issue on 
>>> purpose, mixing defense of the article with 
>>> WP:OTHERSTUFFEXISTS<http://en.wikipedia.org/wiki/Wikipedia:OTHERSTUFFEXISTS>
>>>  to 
>>> invalidate defense, 6) Editors had no idea what a DMS software is and 
>>> confused it with CMS software; and tell me that it is hard to deny how the 
>>> editors appeared to be personally motivated beyond their duties as editors 
>>> to erradicate Mayan article from Wikipedia.
>>>
>>> I'm not trying to be controversial, just answering your argument and 
>>> explaining my desires for not wanting/caring about a Mayan EDMS article on 
>>> Wikipedia.
>>>
>>> --Roberto
>>>
>>>
>>> On Tuesday, October 23, 2012 1:44:13 PM UTC-4, James Hondo wrote:
>>>>
>>>> We are a small accounting firm and have a Windows server working as 
>>>> fileserver with all of our clients' documents sorted by year, month, 
>>>> activities and such.  My question is; Can I import not only the documents, 
>>>> but also their existing directory structure?
>>>>
>>>> Wow I don't always agree with the veteran editor's decision but calling 
>>>> them corrupt is a very heavy handed and strong worded accusation.  Still I 
>>>> think Mayan is a great piece of software with a great community and worthy 
>>>> of an article in Wikipedia, just something to consider. 
>>>>
>>>> On Tuesday, October 23, 2012 3:35:11 AM UTC-4, Roberto Rosario wrote:
>>>>>
>>>>> Hi James,
>>>>>
>>>>> Thanks I apreciate your comments :)
>>>>>
>>>>> Check this thread to see if this is more or less what your are 
>>>>> interested in: 
>>>>> https://groups.google.com/forum/?fromgroups=&pli=1#!topic/mayan-edms/M_S5ZSVV5U4%5B1-25%5D
>>>>>
>>>>> As far as I know there are no Mayan EDMS articles on Wikipedia, there 
>>>>> was one try once and the article got deleted by the most ridiculous of 
>>>>> excuses, it became clear that the editors evaluating the article were 
>>>>> seriously biased against Mayan for what I can only think were monetary 
>>>>> reasons.  Wikipedia as an idea is great, but the project has fallen from 
>>>>> grace, there are very serious moderation and vandalism issues that are as 
>>>>> old as the project and that they have not been able to address.  I don't 
>>>>> have any interest for an article about Mayan on Wikipedia.  Sorry if that 
>>>>> sounds bit harsh since you are just offering to help, I just want to save 
>>>>> you the time and effort of building and defending a great article only to 
>>>>> have corrupt editors delete it once you comply with the self serving 
>>>>> objections they will produce.  I wholeheartedly thank you for your 
>>>>> interest, but it is not worth your time.
>>>>>
>>>>> --Roberto
>>>>>
>>>>>
>>>>> On Monday, October 22, 2012 9:55:58 PM UTC-4, James Hondo wrote:
>>>>>>
>>>>>> Hello, thanks a lot for releasing your software, it is great!  I have 
>>>>>> been looking for something like it for a long time, it does everything I 
>>>>>> needed and then more.  One thing I couldn't found on the documentation; 
>>>>>> can 
>>>>>> it automatically mirror the structure of the document directories when 
>>>>>> doing an initial import?  Also I noticed the Wikipedia article is 
>>>>>> missing a 
>>>>>> great deal of stuff, I've worked on a few articles myself and would 
>>>>>> gladly 
>>>>>> help polish Mayan's article if you like.
>>>>>>
>>>>>> James
>>>>>>
>>>>>

--

[Mayan EDMS: 355] Re: Importing directory tree structure

Reply via email to