Thanks for all the help
Marc
----- Original Message -----
From: "Mark Lindner" <[EMAIL PROTECTED]>
To: "RBASE-L Mailing List" <[email protected]>
Sent: Monday, June 09, 2008 8:58 PM
Subject: [RBASE-L] - RE: Saving Scanned Docs
I store the file info in the database along with a description of the
document, the actual file# is our internally assigned file#/account
number
and a sequence number to make it unique. They are stored in a series of
directories that have 500 in each and then automatically increment to
create
a new one every 500.
The pdf's are scanned into a temp directory, then viewed, indexed and put
into the storage directory. The indexing step allows us to have all
kinds
of documents with varying descriptions found in no time at all.
Works like a charm and is virtually unlimited in size.
Samples available on request.
Mark Lindner
Lindner & Associates PC
254 Second Ave
Needham MA 02494
781 247 1100 Fax 781 247 1143
-----Original Message-----
From: [email protected] [mailto:[EMAIL PROTECTED] On Behalf Of MikeB
Sent: Monday, June 09, 2008 6:34 PM
To: RBASE-L Mailing List
Subject: [RBASE-L] - RE: Saving Scanned Docs
Emmitt, etal,
For my money, as long as I am using a DB, I would also store the relative
path
to every file and not beat myself up too much over folder relationships,
maybe
using a numbering system and incrementing when a tipping point is met
with
respect to the file count (if you already have a large number of files,
you
could test search speed against count to get what works for you).
If the file location is going to be sourced from your db, the count won't
make
much difference, but to external programs it will.
There are some pretty easy little snippets of code to generate the
pathname
of
a file if you have a need to traverse the directory/file structure
periodically
to reaffirm the pathnames.
Here are a couple of links relating to the NTFS and issues of capicity
and
look-up performance. Articles that you might find on "Optimizing NTFS
Performance" that relate to WinXP also are relative to Server 2003.
Also after reviewing the material, you might consider the wisdom of
dedicating
a disk/volume to the archiving process since this is of little impact
from a
cost perspective.
<:http://technet2.microsoft.com/WindowsServer/en/Library/8cc5891d-bf8e-4164-
862d-dac5418c59481033.mspx?mfr=true>
<:http://www.microsoft.com/technet/archive/winntas/tips/winntmag/optntfs.msp
x?mfr=true>
----- Original Message -----
From: "Emmitt Dove" <[EMAIL PROTECTED]>
To: "RBASE-L Mailing List" <[email protected]>
Sent: Monday, June 09, 2008 5:59 PM
Subject: [RBASE-L] - RE: Saving Scanned Docs
It is still better than one folder with 20,000 files. Where the
operating system has a problem in my experience is with one folder and
a very large number of files in that one folder.
Emmitt Dove
Manager, DairyPak Business Systems
Evergreen Packaging, Inc.
[EMAIL PROTECTED]
[EMAIL PROTECTED]
(203) 643-8022
From: [email protected] [mailto:[EMAIL PROTECTED] On Behalf Of Marc
Sent: Monday, June 09, 2008 5:16 PM
To: RBASE-L Mailing List
Subject: [RBASE-L] - RE: Saving Scanned Docs
Hi Emmitt
If you had
C:\ScanFolder\JohnSmith\ with 4-5 files
C:\ScanFolder\BillyJoe\ with 4-5 files
...
...
for say 5,000 customers / sub directories will that still bog it down
since there are 5000 folders and 20,000 files under the main folder
ScanFolder ?
Thanks for all the help
Marc
----- Original Message -----
From: Emmitt Dove <mailto:[EMAIL PROTECTED]>
To: RBASE-L Mailing List <mailto:[email protected]>
Sent: Monday, June 09, 2008 2:56 PM
Subject: [RBASE-L] - RE: Saving Scanned Docs
Marc,
Part of the problem is that when a Windows folder gets over some large
number of files, say 16,000 or more, the operating system's directory
search function - which all programs use - starts to bog down.
Keeping the number of files in a given folder to a manageable number
can only help performance, not to mention programmer sanity when
trying to find something or diagnose a problem.
Emmitt Dove
Manager, DairyPak Business Systems
Evergreen Packaging, Inc.
[EMAIL PROTECTED]
[EMAIL PROTECTED]
(203) 643-8022
From: [email protected] [mailto:[EMAIL PROTECTED] On Behalf Of Marc
Sent: Monday, June 09, 2008 3:35 PM
To: RBASE-L Mailing List
Subject: [RBASE-L] - RE: Saving Scanned Docs
Frank
How were you retrieving the scanned files when it was slow?
I am using a File List box and double clicking on the file.
I am worried my users do not know enough about computers
to save the file in the correct customer folder.
Thanks for the help
Marc
----- Original Message -----
From: Frank Taylor <mailto:[EMAIL PROTECTED]>
To: RBASE-L Mailing List <mailto:[email protected]>
Sent: Monday, June 09, 2008 8:50 AM
Subject: [RBASE-L] - RE: Saving Scanned Docs
Marc,
We are scanning up to 5000 documents for our customers per month. Each
one does have a unique identifier, but when we used to hold them in
one folder we found speed in retrieving them was getting very slow. We
now break them down by month/year and put them in the respective
folder of their creation date. Since then things have gotten much
faster and easier to pull up. So I would recommend some sort of
breakdown scheme to store them into different folders.
n Frank
Frank Taylor - Information Systems Administrator
F.J. O'Hara & Sons, Inc - Araho Transfer Inc.
Boston, MA - Rockland, ME - Miami, FL
Direct Dial - 617-790-3093
email: [EMAIL PROTECTED]
From: [email protected] [mailto:[EMAIL PROTECTED] On Behalf Of Marc
Sent: Monday, June 09, 2008 9:07 AM
To: RBASE-L Mailing List
Subject: [RBASE-L] - Saving Scanned Docs
HI
I need to save about 4-6 scanned documents for about 5,000
customers
Would it be better to create a folder for each customer such as
c:\Scanned Files\Cust 123\
c:\Scanned Files\Cust 124\
c:\Scanned Files\Cust 125\
or
Have all the scanned files in one folder ?
Any suggestions on how to orginize 30,000 scanned files?
I remember years ago there were limits to the number of files per
folder but I have not heard anything about that the last few years
Thanks
Marc
______________________________________________________________________
_
This Email has been scanned for all viruses by PAETEC Email Scanning
Services, utilizing MessageLabs proprietary SkyScan infrastructure. If
you
need assistance, please contact the Systems Administrator, Frank Taylor
at
[EMAIL PROTECTED] - 617-790-3093.
** REMEMBER NEVER OPEN AN ATTACHMENT UNLESS YOU ARE SURE OF IT'S CONTENT
AND
SENDER **
_______________________________________________________________________