[ https://issues.apache.org/jira/browse/SOLR-207?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12488469 ]
Yonik Seeley commented on SOLR-207: ----------------------------------- I tried both versions out, and the "find" version was quicker (on Linux at least). System time was about the same, but "ls" had much higher user time. $ time find . -maxdepth 1 -name 'snapshot.*' | grep -v wip | head -1 ./snapshot.20070411235957 real 0m0.009s user 0m0.002s sys 0m0.008s $ time ls -r . | grep snapshot\\. | grep -v wip | head -1 snapshot.20070412114504 real 0m0.050s user 0m0.043s sys 0m0.009s > snappuller inefficient finding latest snapshot > ---------------------------------------------- > > Key: SOLR-207 > URL: https://issues.apache.org/jira/browse/SOLR-207 > Project: Solr > Issue Type: Bug > Components: replication > Reporter: Yonik Seeley > Attachments: find_maxdepth.patch > > > snapinstaller (and snappuller) do the following to find the latest snapshot: > name=`find ${data_dir} -name snapshot.* -print|grep -v wip|sort -r|head -1` > This recurses into all of the snapshot directories, doing much more disk-io > than is necessary. > I think it is the cause of bloated kernel memory usage we have seen on some > of our Linux boxes, caused > by kernel dentry and inode caches. Those caches compete with buffer cache > (caching the actual data of the index) > and can thus decrease performance. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.