Hi Friends, Thanks for your replies.
Please have a look of the following sites http://www.copyscpe.com and http://www.articlechecker.com/. The above sites are ensured that the posted article is unique and original. The above sites are search in search engines (like google, yahoo and so on) I wish to develop the same. Please let me know the steps how to develop in php / api or so on. On Mon, May 3, 2010 at 5:08 PM, <talk-requ...@lists.nyphp.org> wrote: > Send talk mailing list submissions to > talk@lists.nyphp.org > > To subscribe or unsubscribe via the World Wide Web, visit > http://lists.nyphp.org/mailman/listinfo/talk > or, via email, send a message with subject or body 'help' to > talk-requ...@lists.nyphp.org > > You can reach the person managing the list at > talk-ow...@lists.nyphp.org > > When replying, please edit your Subject line so it is more specific > than "Re: Contents of talk digest..." > > > Today's Topics: > > 1. Re: Plagiarism Checker in PHP (Hans Zaunere) > 2. Re: Plagiarism Checker in PHP (Mutaz Musa) > 3. Re: Plagiarism Checker in PHP (Mutaz Musa) > 4. Re: Plagiarism Checker in PHP (Justin Dearing) > 5. Re: Plagiarism Checker in PHP (Christopher R. Merlo) > 6. Password security for SQLite based pages? (Hall CTR Leam) > > > ---------------------------------------------------------------------- > > Message: 1 > Date: Sun, 2 May 2010 13:58:31 -0400 > From: "Hans Zaunere" <li...@zaunere.com> > To: "'NYPHP Talk'" <talk@lists.nyphp.org> > Subject: Re: [nyphp-talk] Plagiarism Checker in PHP > Message-ID: <00aa01caea21$138cdee0$3aa69c...@com> > Content-Type: text/plain; charset="us-ascii" > > > Hi Friends, > > Hi, > > > I am developing blogger website using PHP & MySQL. And, I would like > > add the feature of Plagiarism Checker. > > > > If any one posted an article in my website, that article must be unique > > and original. > > > > Could you please tell / suggest me how to develop Plagiarism Checker > > feature or send some useful articles / free APIs and so on. > > Hmm, I think there is a recursion problem here - if we tell you, then you'd > be plagiarizing our solution :) > > > Expecting your positive reply ...... > > Seriously though, unless I'm missing something, I can't see how this would > be possible. I suppose you could use techniques such as comparing the > number of similar words between articles, but that's not really exact, and > likely to have incorrect results. Plus, you're looking to do this > plagiarism check across the whole Internet? > > Even humans, who read different articles, have a hard time determining if > something is actually plagiarized or not - I think doing it automatically > is > nearly impossible. > > H > > > > > ------------------------------ > > Message: 2 > Date: Sun, 2 May 2010 14:40:06 -0400 > From: Mutaz Musa <mutazm...@gmail.com> > To: NYPHP Talk <talk@lists.nyphp.org> > Subject: Re: [nyphp-talk] Plagiarism Checker in PHP > Message-ID: > <q2t602c8ac1005021140udd025bcfwbf371cea82f76...@mail.gmail.com> > Content-Type: text/plain; charset="iso-8859-1" > > Actually turnitin.com, a website where students turn in schoolwork, checks > for plagiarism and from what I understand does a pretty good job. They > compare both student-to-student and student-to-literature. I'm not sure how > they pull it off, probably a proprietary algorithm that tries to match > student submissions against their database of content where no match = no > plagiarism. It seems like a mammoth task though. Turnitin.com actually > crawls academic journals, books, etc. to populate their database. > > If you're interested only in comparing submissions to your site against one > another that may be more manageable. If on the other hand you want to > compare against online/print content in general then that will prove a real > challenge. Finally, if you have the money maybe you can register with > turnitin.com - although my impression is that their primarily for > universities/colleges. > > http://turnitin.com/static/index.html > > Best, > Mutaz > > On Sun, May 2, 2010 at 1:58 PM, Hans Zaunere <li...@zaunere.com> wrote: > > > > Hi Friends, > > > > Hi, > > > > > I am developing blogger website using PHP & MySQL. And, I would like > > > add the feature of Plagiarism Checker. > > > > > > If any one posted an article in my website, that article must be unique > > > and original. > > > > > > Could you please tell / suggest me how to develop Plagiarism Checker > > > feature or send some useful articles / free APIs and so on. > > > > Hmm, I think there is a recursion problem here - if we tell you, then > you'd > > be plagiarizing our solution :) > > > > > Expecting your positive reply ...... > > > > Seriously though, unless I'm missing something, I can't see how this > would > > be possible. I suppose you could use techniques such as comparing the > > number of similar words between articles, but that's not really exact, > and > > likely to have incorrect results. Plus, you're looking to do this > > plagiarism check across the whole Internet? > > > > Even humans, who read different articles, have a hard time determining if > > something is actually plagiarized or not - I think doing it automatically > > is > > nearly impossible. > > > > H > > > > > > _______________________________________________ > > New York PHP Users Group Community Talk Mailing List > > http://lists.nyphp.org/mailman/listinfo/talk > > > > http://www.nyphp.org/Show-Participation > > > -------------- next part -------------- > An HTML attachment was scrubbed... > URL: < > http://lists.nyphp.org/pipermail/talk/attachments/20100502/7cd7f7c2/attachment-0001.html > > > > ------------------------------ > > Message: 3 > Date: Sun, 2 May 2010 14:42:05 -0400 > From: Mutaz Musa <mutazm...@gmail.com> > To: NYPHP Talk <talk@lists.nyphp.org> > Subject: Re: [nyphp-talk] Plagiarism Checker in PHP > Message-ID: > <w2n602c8ac1005021142yc63e3272ye681533738599...@mail.gmail.com> > Content-Type: text/plain; charset="iso-8859-1" > > Also, sorry for the double post but check out http://www.ithenticate.com/ > > On Sun, May 2, 2010 at 2:40 PM, Mutaz Musa <mutazm...@gmail.com> wrote: > > > Actually turnitin.com, a website where students turn in schoolwork, > checks > > for plagiarism and from what I understand does a pretty good job. They > > compare both student-to-student and student-to-literature. I'm not sure > how > > they pull it off, probably a proprietary algorithm that tries to match > > student submissions against their database of content where no match = no > > plagiarism. It seems like a mammoth task though. Turnitin.com actually > > crawls academic journals, books, etc. to populate their database. > > > > If you're interested only in comparing submissions to your site against > one > > another that may be more manageable. If on the other hand you want to > > compare against online/print content in general then that will prove a > real > > challenge. Finally, if you have the money maybe you can register with > > turnitin.com - although my impression is that their primarily for > > universities/colleges. > > > > http://turnitin.com/static/index.html > > > > Best, > > Mutaz > > > > > > On Sun, May 2, 2010 at 1:58 PM, Hans Zaunere <li...@zaunere.com> wrote: > > > >> > Hi Friends, > >> > >> Hi, > >> > >> > I am developing blogger website using PHP & MySQL. And, I would like > >> > add the feature of Plagiarism Checker. > >> > > >> > If any one posted an article in my website, that article must be > unique > >> > and original. > >> > > >> > Could you please tell / suggest me how to develop Plagiarism Checker > >> > feature or send some useful articles / free APIs and so on. > >> > >> Hmm, I think there is a recursion problem here - if we tell you, then > >> you'd > >> be plagiarizing our solution :) > >> > >> > Expecting your positive reply ...... > >> > >> Seriously though, unless I'm missing something, I can't see how this > would > >> be possible. I suppose you could use techniques such as comparing the > >> number of similar words between articles, but that's not really exact, > and > >> likely to have incorrect results. Plus, you're looking to do this > >> plagiarism check across the whole Internet? > >> > >> Even humans, who read different articles, have a hard time determining > if > >> something is actually plagiarized or not - I think doing it > automatically > >> is > >> nearly impossible. > >> > >> H > >> > >> > >> _______________________________________________ > >> New York PHP Users Group Community Talk Mailing List > >> http://lists.nyphp.org/mailman/listinfo/talk > >> > >> http://www.nyphp.org/Show-Participation > >> > > > > > -------------- next part -------------- > An HTML attachment was scrubbed... > URL: < > http://lists.nyphp.org/pipermail/talk/attachments/20100502/53705f9d/attachment-0001.html > > > > ------------------------------ > > Message: 4 > Date: Sun, 2 May 2010 16:43:19 -0400 > From: Justin Dearing <zippy1...@gmail.com> > To: NYPHP Talk <talk@lists.nyphp.org> > Subject: Re: [nyphp-talk] Plagiarism Checker in PHP > Message-ID: > <o2y5458db3c1005021343n25d79d4ex40dd3c00d9034...@mail.gmail.com> > Content-Type: text/plain; charset="iso-8859-1" > > On Sun, May 2, 2010 at 1:58 PM, Hans Zaunere <li...@zaunere.com> wrote: > > > > Hi Friends, > > > > Hi, > > > > > Could you please tell / suggest me how to develop Plagiarism Checker > > > feature or send some useful articles / free APIs and so on. > > > > > > Seriously though, unless I'm missing something, I can't see how this > would > > be possible. I suppose you could use techniques such as comparing the > > number of similar words between articles, but that's not really exact, > and > > likely to have incorrect results. Plus, you're looking to do this > > plagiarism check across the whole Internet? > > > > My understanding is many CS professors do this for programming homework. > They are looking for exact matches. Apparently that catches a lot of > people. > > I think chopping up an article into an array of sentences, and throwing a > few into google would be a good approach. Submit 25% of the sentences to > google as exact phrase matches. Throw the first 10 result urls into an > array > for each. Sort the urls and see how many are the same. > > Justin > -------------- next part -------------- > An HTML attachment was scrubbed... > URL: < > http://lists.nyphp.org/pipermail/talk/attachments/20100502/9dda592a/attachment-0001.html > > > > ------------------------------ > > Message: 5 > Date: Sun, 2 May 2010 18:51:49 -0400 > From: "Christopher R. Merlo" <cme...@ncc.edu> > To: NYPHP Talk <talk@lists.nyphp.org> > Subject: Re: [nyphp-talk] Plagiarism Checker in PHP > Message-ID: > <s2s946586481005021551w28bffffepa797335c4e25b...@mail.gmail.com> > Content-Type: text/plain; charset=ISO-8859-1 > > On Sunday, May 2, 2010, Justin Dearing <zippy1...@gmail.com> wrote: > > > > My understanding is many CS professors do this for programming homework. > They are looking for exact matches. Apparently that catches a lot of people. > > Actually, checking for exact matches won't catch the students who > change nothing but identifiers. But there's a really cool free tool > provided by someone at UC Berkeley called Moss that does some sort of > lexical analysis, along with pattern matching, that has helped me > catch lots of unscrupulous programming students. (Yes, lots. The rare > downside of teaching at a community college.) > > > I think chopping up an article into an array of sentences, and throwing a > few into google would be a good approach. > > And that's more or less what I do when I suspect plagiarism in a > written assignment. I don't (yet) do this in any automated way; only > when it seems obvious I should. Don't forget to surround the sentence > with quotes for Google to reduce the amount of false positives. > -c > > > ------------------------------ > > Message: 6 > Date: Mon, 3 May 2010 07:38:02 -0400 > From: "Hall CTR Leam" <leam.hall....@usmc.mil> > To: <talk@lists.nyphp.org> > Subject: [nyphp-talk] Password security for SQLite based pages? > Message-ID: > < > 06a7a82c48a99643b8b77c99946afecb02f1c...@mcusquanez02v.mcdsus.mcds.usmc.mil > > > > Content-Type: text/plain; charset="us-ascii" > > Morning all! > > I'm looking to not use MySQL for some web pages, but to instead try SQLite. > There's nothing in the database that would need to be protected from > reading, and it's a small site that doesn't need a full blown database. > However, I'd like to limit access to updating. Would it be enough just to > have a web page that you had to log in via .htaccess? Is there a better way? > > Thanks! > > Leam > > > -------------------------------------------- > Leam Hall > UNIX Systems Administrator > Contractor for Smartronix ( CMMI Level 3 ISO 9001:2000 FS 91000 ) > Com: 229.639.6028 > Email: leam.hall....@usmc.mil > -------------------------------------------- > -------------- next part -------------- > A non-text attachment was scrubbed... > Name: smime.p7s > Type: application/x-pkcs7-signature > Size: 5206 bytes > Desc: not available > URL: < > http://lists.nyphp.org/pipermail/talk/attachments/20100503/f261fa34/attachment.bin > > > > ------------------------------ > > _______________________________________________ > talk mailing list > talk@lists.nyphp.org > http://lists.nyphp.org/mailman/listinfo/talk > > End of talk Digest, Vol 43, Issue 4 > *********************************** > -- T.R.Parthasarathy +919884117406 “You have to grow from the inside out. None can teach you, none can make you spiritual. There is no other teacher but your own soul.” - Swami Vivekananda
_______________________________________________ New York PHP Users Group Community Talk Mailing List http://lists.nyphp.org/mailman/listinfo/talk http://www.nyphp.org/Show-Participation