On Sun, May 2, 2010 at 1:58 PM, Hans Zaunere <li...@zaunere.com> wrote:
> > Hi Friends, > > Hi, > > > Could you please tell / suggest me how to develop Plagiarism Checker > > feature or send some useful articles / free APIs and so on. > > > Seriously though, unless I'm missing something, I can't see how this would > be possible. I suppose you could use techniques such as comparing the > number of similar words between articles, but that's not really exact, and > likely to have incorrect results. Plus, you're looking to do this > plagiarism check across the whole Internet? > > My understanding is many CS professors do this for programming homework. They are looking for exact matches. Apparently that catches a lot of people. I think chopping up an article into an array of sentences, and throwing a few into google would be a good approach. Submit 25% of the sentences to google as exact phrase matches. Throw the first 10 result urls into an array for each. Sort the urls and see how many are the same. Justin
_______________________________________________ New York PHP Users Group Community Talk Mailing List http://lists.nyphp.org/mailman/listinfo/talk http://www.nyphp.org/Show-Participation