Hello Gus!

I already knew and tested grep.app. It is definitely a great soft!

I need my own search because:
- I need regex on ALL search in a way similar to sourcegraph that already
does it
- I plan to edit the search query in a custom way to set codeql rules in a
far future
- I need to choose a selected range of indexed repos
- sometimes I also need more repos than the ones that have already been
indexed.
- I like to use only open source in case of issue in the future. Also to
avoid depenence to a tool/ to an infra

If I could contact your friend, it would be a great help! The world is
tiny! :)

Le mer. 8 janv. 2025 à 17:41, Gus Heck <gus.h...@gmail.com> a écrit :

> Perhaps you're looking for https://grep.app/ ? It does regex search vs
> github and was recently acquired by Vercel. It was written by a friend of
> mine.
>
> On Wed, Jan 8, 2025 at 9:44 AM anon anon <anonimoussech...@gmail.com>
> wrote:
>
> > Markus: I probably misunderstood your remark.
> >
> > Could it be possible to use a git clone protocol plugin please?
> >
> > Le mer. 8 janv. 2025 à 15:41, anon anon <anonimoussech...@gmail.com> a
> > écrit :
> >
> > > David:
> > >
> > > I also would like to ensure I clarified correctly.
> > >
> > > I absolutely need to index source code to my personal search engine to
> > run
> > > a regex in solr. I want to look for vulnerabilities with the regex.
> > >
> > > COuld you provide the steps for a such configuration of nutch and
> > > eventually solr please?
> > >
> > > Best regards.
> > >
> > > Le mer. 8 janv. 2025 à 15:25, anon anon <anonimoussech...@gmail.com> a
> > > écrit :
> > >
> > >> Hello David,
> > >>
> > >> I need a git "clone" indexer to index an as huge as possible database
> of
> > >> repo to make cyber security research for my job.
> > >>
> > >> Hello Markus,
> > >>
> > >> I am open to any proposition.
> > >>
> > >> I did not found in the doc how to make a git clone only of a repo url
> > >> from the crawler indexer config regex. I also see in the source code
> > there
> > >> https://github.com/apache/nutch/tree/master/src/plugin that the
> > protocol
> > >> supported are present there. I doubt I could add my own custom
> protocol
> > in
> > >> config. I hope I am wrong. If you are sure I could glone a repo in
> nucth
> > >> config directly, could you tell me how please?
> > >>
> > >> If really you think I need to fork the repo, I can do it as well.
> > >>
> > >> Best regards.
> > >>
> > >> Le mar. 7 janv. 2025 à 16:01, Markus Jelsma <
> markus.jel...@openindex.io
> > >
> > >> a écrit :
> > >>
> > >>> Hi,
> > >>>
> > >>> Nutch is, just as Solr, highly customizable using all sorts of
> plugins.
> > >>> Forking it is not recommended. If you happen to come across behaviour
> > in
> > >>> one of its tools that is not configurable, it can be made
> configurable.
> > >>>
> > >>> Regards,
> > >>> Markus
> > >>>
> > >>> Op di 7 jan 2025 om 16:52 schreef David Smiley <dsmi...@apache.org>:
> > >>>
> > >>> > Forking anything is a burden on you to maintain your fork.  You
> > didn't
> > >>> say
> > >>> > *why* you want to fork something instead of simply use something.
> > You
> > >>> > mentioned adding features but search engine platforms like Solr are
> > >>> > designed to be highly pluggable/extensible without forking.  It's a
> > >>> > platform not a product.
> > >>> >
> > >>> > On Sun, Jan 5, 2025 at 6:36 PM anon <anonimoussech...@gmail.com>
> > >>> wrote:
> > >>> >
> > >>> > > Hello people!!
> > >>> > >
> > >>> > > I was going to fork sourcegraph because I was looking for a
> search
> > >>> > > engine specific to code source such as github and gitlab with the
> > >>> > > possibility to index decompiled file offline. then I read this
> > >>> copyright
> > >>> > >
> > >>> > >
> > >>> >
> > >>>
> >
> https://github.com/sourcegraph/sourcegraph-public-snapshot/blob/main/LICENSE.enterprise
> > >>> > > <
> > >>> > >
> > >>> >
> > >>>
> >
> https://github.com/sourcegraph/sourcegraph-public-snapshot/blob/main/LICENSE.enterprise
> > >>> > >
> > >>> > >
> > >>> > > it seems to be *more than* proprietary. Then I just found
> > >>> opensearch. It
> > >>> > > seems modular. I might fork it to:
> > >>> > > 1- index only source code from github/gitlab and from local to my
> > >>> > instance
> > >>> > > 2- use regex and codeql queries in the search client.
> > >>> > >
> > >>> > > Opensearch seems good but not modular enough.
> > >>> > >
> > >>> > >
> > >>> > > I think, solr the best choice for me. I will complete with a fork
> > on
> > >>> > nutch.
> > >>> > >
> > >>> > > I think a Nutch fork would absolutely complete what I am looking
> > for:
> > >>> > >
> > >>> > > - it is free software
> > >>> > >
> > >>> > > - it is modular on many protocol (not git yet), and solr
> compatible
> > >>> > >
> > >>> > > I suggest that I fork nutch to add a plugin there
> > >>> > > https://github.com/apache/nutch/tree/master/src/plugin under a
> new
> > >>> > > folder protocol-file and why not let people fork it.
> > >>> > >
> > >>> > > Is it a good idea?
> > >>> > >
> > >>> > > Best regards.
> > >>> > >
> > >>> >
> > >>>
> > >>
> >
>
>
> --
> http://www.needhamsoftware.com (work)
> https://a.co/d/b2sZLD9 (my fantasy fiction book)
>

Reply via email to