Re: Baloo - Not Indexing everything by default

2014-10-17 Thread Todd Rme
On Thu, Oct 16, 2014 at 2:15 PM, Martin Gräßlin mgraess...@kde.org wrote: On Thursday 16 October 2014 13:20:57 Vishesh Handa wrote: Hey guys While Baloo performs better than Nepomuk. It does have its share of problems - mostly large text files, and high IO usage. Additionally, users on linux

Re: Baloo - Not Indexing everything by default

2014-10-17 Thread Vishesh Handa
On Fri, Oct 17, 2014 at 2:11 AM, Ömer Fadıl USTA omeru...@gmail.com wrote: Couldnt we add a .baloo file to specify for attributes for current directory, file or subdirectories about not indexing ? For example of a .baloo file skip_all skip_if_greater 1m skip_if_smaller 50k skip_ext txt jpg

Re: Baloo - Not Indexing everything by default

2014-10-17 Thread Eike Hein
On 17.10.2014 18:24, Vishesh Handa wrote: About gibberish. It's hard to figure out what gibberish is. I think I'll add some code that we only index the first 20 characters of each word. That should help to a certain extent. Define word - Chinese and Japanese (unless mostly kana) often don't

Re: Baloo - Not Indexing everything by default

2014-10-17 Thread Rex Dieter
Vishesh Handa wrote: ... Instead, we could only index - * $HOME - Not including any subfolders. * Desktop, Documents, Videos, Pictures and Music. All of these are xdg user directories. +1 Yes, please! -- Rex ___ Plasma-devel mailing list

Re: Re: Baloo - Not Indexing everything by default

2014-10-17 Thread Martin Gräßlin
On Friday 17 October 2014 18:24:48 Vishesh Handa wrote: On Thu, Oct 16, 2014 at 2:15 PM, Martin Gräßlin mgraess...@kde.org wrote: the txt being genome data doesn't surprise me[1], but I find it sad that now txt is disabled by default (I use them quite a lot for blog posts). As genome

Re: Baloo - Not Indexing everything by default

2014-10-17 Thread Mark Gaiser
On Thu, Oct 16, 2014 at 1:20 PM, Vishesh Handa m...@vhanda.in wrote: Hey guys While Baloo performs better than Nepomuk. It does have its share of problems - mostly large text files, and high IO usage. Additionally, users on linux often seem to have the craziest files. Currently, we do not

Baloo - Not Indexing everything by default

2014-10-16 Thread Vishesh Handa
Hey guys While Baloo performs better than Nepomuk. It does have its share of problems - mostly large text files, and high IO usage. Additionally, users on linux often seem to have the craziest files. Currently, we do not index plain text files which do not have a `.txt` extension, because

Re: Baloo - Not Indexing everything by default

2014-10-16 Thread Marco Martin
On Thursday 16 October 2014, Vishesh Handa wrote: * $HOME - Not including any subfolders. * Desktop, Documents, Videos, Pictures and Music. All of these are xdg user directories. Gnome Tracker actually does something quite similar. Comments? +1 i tend to prefer a whitelist on what to

Re: Baloo - Not Indexing everything by default

2014-10-16 Thread David Edmundson
Seems a shame to do this now that things are working so well. Since the .txt only change Baloo hasn't bothered me at all. David ___ Plasma-devel mailing list Plasma-devel@kde.org https://mail.kde.org/mailman/listinfo/plasma-devel

Re: Baloo - Not Indexing everything by default

2014-10-16 Thread Martin Steigerwald
Am Donnerstag, 16. Oktober 2014, 13:20:57 schrieb Vishesh Handa: Hey guys Hi Vishesh, While Baloo performs better than Nepomuk. It does have its share of problems - mostly large text files, and high IO usage. Additionally, users on linux often seem to have the craziest files. Currently, we

Re: Baloo - Not Indexing everything by default

2014-10-16 Thread Martin Steigerwald
Am Donnerstag, 16. Oktober 2014, 13:27:02 schrieb Marco Martin: On Thursday 16 October 2014, Vishesh Handa wrote: * $HOME - Not including any subfolders. * Desktop, Documents, Videos, Pictures and Music. All of these are xdg user directories. Gnome Tracker actually does something

Re: Baloo - Not Indexing everything by default

2014-10-16 Thread Kai Uwe Broulik
Hi, * Desktop, Documents, Videos, Pictures and Music. All of these are xdg user directories. The only reason I actually index my kf5 folder with all the git clones in it, is because Dolphin doesn't properly fallback when searching non-indexed locations and then it won't find anything (in

Re: Baloo - Not Indexing everything by default

2014-10-16 Thread Eike Hein
On 16.10.2014 13:20, Vishesh Handa wrote: Comments? I understand the pragmatic motivation behind it, but it seems like a strange step to me. The idea behind indexing is that you can find things regardless of location, so you don't need to be aware of where things are. By making the index

Re: Baloo - Not Indexing everything by default

2014-10-16 Thread Martin Gräßlin
On Thursday 16 October 2014 13:20:57 Vishesh Handa wrote: Hey guys While Baloo performs better than Nepomuk. It does have its share of problems - mostly large text files, and high IO usage. Additionally, users on linux often seem to have the craziest files. Currently, we do not index plain

Re: Baloo - Not Indexing everything by default

2014-10-16 Thread Luca Beltrame
In data giovedì 16 ottobre 2014 14:15:15, Martin Gräßlin ha scritto: genome data is really huge wouldn't it make sense to go rather for file size or abort the indexing if it's obvious random gibberish? As the person who mentioned this first (hey, I'm famous ;), I'm guessing that limiting on

Re: Baloo - Not Indexing everything by default

2014-10-16 Thread Martin Steigerwald
Am Donnerstag, 16. Oktober 2014, 14:20:06 schrieb Luca Beltrame: In data giovedì 16 ottobre 2014 14:15:15, Martin Gräßlin ha scritto: genome data is really huge wouldn't it make sense to go rather for file size or abort the indexing if it's obvious random gibberish? As the person who

Re: Baloo - Not Indexing everything by default

2014-10-16 Thread Weng Xuetian
As for text file, in linux world people don't usually use .txt extension, especially when writing something like vimwiki or something similar. I guess cap the size is some what better solution ( 1-5MB is good enough). And as for folder limitation, that doesn't sound good, people usually organze

Re: Baloo - Not Indexing everything by default

2014-10-16 Thread Ömer Fadıl USTA
Couldnt we add a .baloo file to specify for attributes for current directory, file or subdirectories about not indexing ? For example of a .baloo file skip_all skip_if_greater 1m skip_if_smaller 50k skip_ext txt jpg With --subdirs flag like Skip_ext --subdirs txt jpg Skip_all --subdirs On Oct 16,