Now this isn't a direct answer to your problem.... However you might want to
try building some hierarchy to your directory tree. Something like
/year/month/day. So today's directory would be /2010/11/17/. This will avoid
putting many entries in one directory.

As a historical note, the original UNIX file system was designed as a
hierarchy so the phone book could be stored with the higher level
directories having names related to the first few letters of the name being
stored.

-Jeff


On Wed, Nov 17, 2010 at 1:12 AM, Tim Nufire <[email protected]>
wrote:
> All,
> A quick update on this issue... Our application creates 1 new top level
> directory each day and after about 500 days *all* of the servers I've
> checked have corrupt root nodes. Even more troubling, after we repair a
> volume by running jfs_jfsck and recovering data from lost+found (see
below),
> the problem re-occurs after about a month of creating new directories.
> However, if no new top level directories are created, and only changes
lower
> down in the hierarchy are made, the problem does not reoccur.
> Does anyone have any theories about what is going on here? Is there
anything
> we can do to prevent this from happening? Would moving all the data down
one
> level (e.g. nested in a single root directory) help or is the root node
like
> any other node and 500+ nested directories at any level too much for JFS?
> Because these are older machines, they are all running Debian 4 with a
> backported 2.6.26 kernel.. Is there any chance upgrading to Debian 5 and a
> newer kernel would help?
> Thanks in advance for any help :-)
> Tim
> On Aug 25, 2010, at 2:16 PM, Tim Nufire wrote:
>
> Hello,
> I've got a problem that I'm hoping someone on this list can help me
with...
> Read-only fsck.jfs checks on my oldest volumes are reporting an alarming
> number of corrupted root nodes despite the fact that these volumes appear
to
> be healthy when mounted read-only. Here's the error that I'm getting...
> fsck.jfs -n -v /dev/md/10
> fsck.jfs version 1.1.14, 06-Apr-2009
> processing started: 8/13/2010 10.9.6
> The current device is:  /dev/md/10
> Open(...READONLY...) returned rc = 0
> Primary superblock is valid.
> The type of file system for the device is JFS.
> Block size in bytes:  4096
> Filesystem size in blocks:  4756914448
> **Phase 1 - Check Blocks, Files/Directories, and  Directory Entries
> Invalid data format detected in root directory.
> CANNOT CONTINUE.
> ERRORS HAVE BEEN DETECTED.  Run fsck with the -f parameter to repair.
> processing terminated:  8/13/2010 10:10:05  with return code: 10062  exit
> code: 4.
> Despite the catastrophic sounding error above, mounting the file system
> read-only and listing the directory from the command-line works fine....
> ls
> 20090110  20090303  20090418  20090605  20090721  20090914  20091030
>  20091215  20100130  20100317  20100502  20100617
> 20090111  20090304  20090419  20090606  20090722  20090915  20091031
>  20091216  20100131  20100318  20100503  20100618
> 20090113  20090305  20090420  20090607  20090723  20090916  20091101
>  20091217  20100201  20100319  20100504  20100619
> 20090114  20090306  20090421  20090608  20090724  20090917  20091102
>  20091218  20100202  20100320  20100505  20100620
> 20090115  20090307  20090422  20090609  20090725  20090918  20091103
>  20091219  20100203  20100321  20100506  20100622
> 20090116  20090308  20090423  20090610  20090727  20090919  20091104
>  20091220  20100204  20100322  20100507  20100623
> 20090117  20090309  20090424  20090611  20090728  20090920  20091105
>  20091221  20100205  20100323  20100508  20100624
> 20090118  20090310  20090425  20090612  20090729  20090921  20091106
>  20091222  20100206  20100324  20100509  20100625
> 20090119  20090311  20090426  20090613  20090730  20090922  20091107
>  20091223  20100207  20100325  20100510  20100626
> 20090120  20090312  20090427  20090614  20090731  20090923  20091108
>  20091224  20100208  20100326  20100511  20100627
> 20090121  20090313  20090428  20090615  20090801  20090924  20091109
>  20091225  20100209  20100327  20100512  20100628
> 20090122  20090314  20090429  20090616  20090802  20090925  20091110
>  20091226  20100210  20100328  20100513  20100629
> 20090123  20090315  20090430  20090617  20090803  20090926  20091111
>  20091227  20100211  20100329  20100514  20100630
> 20090126  20090316  20090501  20090618  20090804  20090927  20091112
>  20091228  20100212  20100330  20100515  20100701
> 20090127  20090317  20090502  20090619  20090805  20090928  20091113
>  20091229  20100213  20100331  20100516  20100702
> 20090128  20090318  20090503  20090620  20090809  20090929  20091114
>  20091230  20100214  20100401  20100517  20100703
> 20090129  20090319  20090504  20090621  20090810  20090930  20091115
>  20091231  20100215  20100402  20100518  20100704
> 20090130  20090320  20090505  20090622  20090811  20091001  20091116
>  20100101  20100216  20100403  20100519  20100705
> 20090202  20090321  20090506  20090623  20090812  20091002  20091117
>  20100102  20100217  20100404  20100520  20100706
> 20090204  20090322  20090507  20090624  20090813  20091003  20091118
>  20100103  20100218  20100405  20100521  20100707
> 20090205  20090323  20090508  20090625  20090814  20091004  20091119
>  20100104  20100219  20100406  20100522  20100708
> 20090206  20090324  20090509  20090626  20090815  20091005  20091120
>  20100105  20100220  20100407  20100523  20100709
> 20090207  20090325  20090510  20090627  20090816  20091006  20091121
>  20100106  20100221  20100408  20100524  20100710
> 20090208  20090326  20090511  20090628  20090817  20091007  20091122
>  20100107  20100222  20100409  20100525  20100711
> 20090209  20090327  20090512  20090629  20090818  20091008  20091123
>  20100108  20100223  20100410  20100526  20100712
> 20090210  20090328  20090513  20090630  20090819  20091009  20091124
>  20100109  20100224  20100411  20100527  20100713
> 20090211  20090329  20090514  20090701  20090820  20091010  20091125
>  20100110  20100225  20100412  20100528  20100714
> 20090212  20090330  20090515  20090702  20090821  20091011  20091126
>  20100111  20100226  20100413  20100529  20100715
> 20090213  20090331  20090516  20090703  20090822  20091012  20091127
>  20100112  20100227  20100414  20100530  20100716
> 20090214  20090401  20090517  20090704  20090823  20091013  20091128
>  20100113  20100228  20100415  20100531  20100717
> 20090215  20090402  20090518  20090705  20090824  20091014  20091129
>  20100114  20100301  20100416  20100601  20100718
> 20090216  20090403  20090519  20090706  20090825  20091015  20091130
>  20100115  20100302  20100417  20100602  20100719
> 20090217  20090404  20090520  20090707  20090826  20091016  20091201
>  20100116  20100303  20100418  20100603  20100720
> 20090218  20090405  20090521  20090708  20090827  20091017  20091202
>  20100117  20100304  20100419  20100604  20100721
> 20090219  20090406  20090522  20090709  20090828  20091018  20091203
>  20100118  20100305  20100420  20100605  20100722
> 20090220  20090407  20090523  20090710  20090901  20091019  20091204
>  20100119  20100306  20100421  20100606  20100723
> 20090221  20090408  20090524  20090711  20090902  20091020  20091205
>  20100120  20100307  20100422  20100607  20100724
> 20090222  20090409  20090527  20090712  20090903  20091021  20091206
>  20100121  20100308  20100423  20100608  20100725
> 20090223  20090410  20090528  20090713  20090904  20091022  20091207
>  20100122  20100309  20100424  20100609  20100726
> 20090224  20090411  20090529  20090714  20090905  20091023  20091208
>  20100123  20100310  20100425  20100610  20100727
> 20090225  20090412  20090530  20090715  20090906  20091024  20091209
>  20100124  20100311  20100426  20100611  20100728
> 20090226  20090413  20090531  20090716  20090907  20091025  20091210
>  20100125  20100312  20100427  20100612  20100729
> 20090227  20090414  20090601  20090717  20090908  20091026  20091211
>  20100126  20100313  20100428  20100613  mount_check
> 20090228  20090415  20090602  20090718  20090909  20091027  20091212
>  20100127  20100314  20100429  20100614
> 20090301  20090416  20090603  20090719  20090912  20091028  20091213
>  20100128  20100315  20100430  20100615
> 20090302  20090417  20090604  20090720  20090913  20091029  20091214
>  20100129  20100316  20100501  20100616
> Running fsck.jfs read-wrirte re-initiallizes the root node and moves all
of
> its former contents into lost+found. I can recover the data from
lost+found
> so this is not fatal but still something I would like to fix/avoid.
> I have not repaired the above volume yet but have repaired others...
Here's
> the fsck.jfs output for a read-write repair on a volume that had the same
> errors as those described above.
> fsck.jfs -v /dev/md10
> fsck.jfs version 1.1.14, 06-Apr-2009
> processing started: 4/23/2010 4.32.24
> Using default parameter: -p
> The current device is:  /dev/md10
> Open(...READ/WRITE EXCLUSIVE...) returned rc = 0
> Primary superblock is valid.
> The type of file system for the device is JFS.
> Block size in bytes:  4096
> Filesystem size in blocks:  4756914448
> **Phase 0 - Replay Journal Log
> LOGREDO:  Log record for Sync Point at:    0x05774f34
> LOGREDO:  Beginning to update the Inode Allocation Map.
> LOGREDO:  Done updating the Inode Allocation Map.
> LOGREDO:  Beginning to update the Block Map.
> LOGREDO:  Incorrect leaf index detected (k=(d) 0, j=(d) 0, idx=(d) 0)
while
> writing Block Map.
> LOGREDO:  Write Block Map control page failed in UpdateMaps().
> LOGREDO:  Unable to update map(s).
> logredo failed (rc=-231).  fsck continuing.
> **Phase 1 - Check Blocks, Files/Directories, and  Directory Entries
> Root directory has a corrupt tree.
> Initialized tree created for root directory.
> The root directory has an invalid data format.  Will correct.
> **Phase 2 - Count links
> **Phase 3 - Duplicate Block Rescan and Directory Connectedness
> **Phase 4 - Report Problems
> **Phase 5 - Check Connectivity
> **Phase 6 - Perform Approved Corrections
> Superblock marked dirty because repairs are about to be written.
> No \lost+found directory found in the filesystem.
> Directory inode  18661404 has been reconnected to /lost+found/.
> Directory inode  18637982 has been reconnected to /lost+found/.
> Directory inode  18614880 has been reconnected to /lost+found/.
> Directory inode  18595359 has been reconnected to /lost+found/.
> Directory inode  18581312 has been reconnected to /lost+found/.
> Directory inode  18556038 has been reconnected to /lost+found/.
> .
> .
> .
> Directory inode  448971 has been reconnected to /lost+found/.
> File inode  443531 has been reconnected to /lost+found/.
> Directory inode  442414 has been reconnected to /lost+found/.
> .
> .
> .
> Directory inode  2320 has been reconnected to /lost+found/.
> Directory inode  101 has been reconnected to /lost+found/.
> Directory inode  32 has been reconnected to /lost+found/.
> 622 directories reconnected to /lost+found/.
> 1 file reconnected to /lost+found/.
> **Phase 7 - Rebuild File/Directory Allocation Maps
> **Phase 8 - Rebuild Disk Allocation Maps
> **Phase 9 - Reformat File System Log
> logformat returned rc = 0
> Filesystem Summary:
> Blocks in use for inodes:  2276956
> Inode count:  18215648
> File count:  16453081
> Directory count:  1529882
> Block count:  4756914448
> Free block count:  655162544
> 19027657792 kilobytes total disk space.
>   6342069 kilobytes in 1529882 directories.
> 16397493672 kilobytes in 16453081 user files.
>         0 kilobytes in extended attributes
>         0 kilobytes in access control lists
>  15856013 kilobytes reserved for system use.
> 2620650176 kilobytes are available for use.
> Filesystem is clean.
> All observed inconsistencies have been repaired.
> Filesystem has been marked clean.
> **** Filesystem was modified. ****
> processing terminated:  4/23/2010 9:08:55  with return code: 0  exit code:
> 1.
> This problem appears to be related to age and/or the number of directories
> in the root node. It's hard to distinguish between these two attributes in
> our environment because the root node of our data volumes contain one
> directory for each day the volume has been in use. The tipping point
appears
> to be around 500 days/directories.
> Is this a known issue? Is there really a problem  with the root node or
does
> fsck.jfs have an analysis bug? In any event, since the OS can list the
> contents of the root node, fsck.jfs should be able to do better than just
> dumping all the contents into lost+found.
> I've also seen corruption in my allocation maps which could be
> related... How can I help debug this further?
> Thanks!
> Tim
>
------------------------------------------------------------------------------
> Sell apps to millions through the Intel(R) Atom(Tm) Developer Program
> Be part of this innovative community and reach millions of netbook users
> worldwide. Take advantage of special opportunities to increase revenue and
> speed time-to-market. Join now, and jumpstart your future.
>
http://p.sf.net/sfu/intel-atom-d2d_______________________________________________
> Jfs-discussion mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/jfs-discussion
>
>
>
------------------------------------------------------------------------------
> Beautiful is writing same markup. Internet Explorer 9 supports
> standards for HTML5, CSS3, SVG 1.1,  ECMAScript5, and DOM L2 & L3.
> Spend less time writing and  rewriting code and more time creating great
> experiences on the web. Be a part of the beta today
> http://p.sf.net/sfu/msIE9-sfdev2dev
> _______________________________________________
> Jfs-discussion mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/jfs-discussion
>
>
------------------------------------------------------------------------------
Beautiful is writing same markup. Internet Explorer 9 supports
standards for HTML5, CSS3, SVG 1.1,  ECMAScript5, and DOM L2 & L3.
Spend less time writing and  rewriting code and more time creating great
experiences on the web. Be a part of the beta today
http://p.sf.net/sfu/msIE9-sfdev2dev
_______________________________________________
Jfs-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/jfs-discussion

Reply via email to