search engines....

2000-03-22 Thread Rodrigo
hi, i'm making a graduation work about search engines and i would like to know if you have some information (books, pages, articles) about how create a search engine, how they work, programs that it uses to classificate pages, spiders and robots, agents... at the moment. thanks for any ki

Re: Asking for information

2000-03-22 Thread Avi Rappoport
At 7:58 PM -0800 3/19/2000, Jaime Gomez wrote: >I am trying to write my own web robot and I haven't found enough >information yet. If someone can tell me where can I find the information >(books, web pages) I would be very pleased. >Regards: Jaime Gomez As mentioned before, I have quite a bit of

Re: robots.txt a security hole??

2000-03-22 Thread Walter Underwood
At 7:36 AM -0500 3/22/00, Anthony Kirlew wrote: >I have heard much talk of the security issue recently. Here is one way to >get around this. Lets say you have a file called "private". You could put >it in a folder called "icons" (or some other generic name) and then do a >disallow on "/icons" th

Re: Asking for information

2000-03-22 Thread Rodrigo
try.. A sample search engine source code is at http://xav.com/latest/search.txt Docs are at http://xav.com/scripts/search/ "The Anatomy of a Large-Scale Hypertextual Web Search Engine" at http://www7.scu.edu.au/programme/fullpapers/1921/com1921.htm. this might help --- Avi Rappoport <[EMA

Re: Commerce Robots

2000-03-22 Thread Corey Schwartz
Richard, My firm is becoming actively engaged in developing a very specialized robot. It is possible that we could spin off the core technology for your project. Please contact me off list at: Corey Schwartz 602.739.7060 > -Original Message- > From: [EMAIL PROTECTED] [mailto:[EMAIL

Re: robots.txt a security hole??

2000-03-22 Thread Anthony Kirlew
I have heard much talk of the security issue recently. Here is one way to get around this. Lets say you have a file called "private". You could put it in a folder called "icons" (or some other generic name) and then do a disallow on "/icons" that way you wouldn't be giving away the name of your