NYCPHP Meetup

NYPHP.org

[nycphp-talk] talk Digest, Vol 43, Issue 4

Partha Sarathy parthatr at gmail.com
Mon May 3 08:26:02 EDT 2010


Hi Friends,

Thanks for your replies.

Please have a look of the following sites http://www.copyscpe.com and
http://www.articlechecker.com/.

The above sites are ensured that the posted article is unique and original.

The above sites are search in search engines (like google, yahoo and so on)

I wish to develop the same.

Please let me know the steps how to develop in php / api or so on.



On Mon, May 3, 2010 at 5:08 PM, <talk-request at lists.nyphp.org> wrote:

> Send talk mailing list submissions to
>        talk at lists.nyphp.org
>
> To subscribe or unsubscribe via the World Wide Web, visit
>        http://lists.nyphp.org/mailman/listinfo/talk
> or, via email, send a message with subject or body 'help' to
>        talk-request at lists.nyphp.org
>
> You can reach the person managing the list at
>        talk-owner at lists.nyphp.org
>
> When replying, please edit your Subject line so it is more specific
> than "Re: Contents of talk digest..."
>
>
> Today's Topics:
>
>   1. Re: Plagiarism Checker in PHP (Hans Zaunere)
>   2. Re: Plagiarism Checker in PHP (Mutaz Musa)
>   3. Re: Plagiarism Checker in PHP (Mutaz Musa)
>   4. Re: Plagiarism Checker in PHP (Justin Dearing)
>   5. Re: Plagiarism Checker in PHP (Christopher R. Merlo)
>   6. Password security for SQLite based pages? (Hall CTR Leam)
>
>
> ----------------------------------------------------------------------
>
> Message: 1
> Date: Sun, 2 May 2010 13:58:31 -0400
> From: "Hans Zaunere" <lists at zaunere.com>
> To: "'NYPHP Talk'" <talk at lists.nyphp.org>
> Subject: Re: [nycphp-talk] Plagiarism Checker in PHP
> Message-ID: <00aa01caea21$138cdee0$3aa69ca0$@com>
> Content-Type: text/plain;       charset="us-ascii"
>
> > Hi Friends,
>
> Hi,
>
> > I am developing blogger website using PHP & MySQL. And, I would like
> > add the feature of Plagiarism Checker.
> >
> > If any one posted an article in my website, that article must be unique
> > and original.
> >
> > Could you please tell / suggest me how to develop Plagiarism Checker
> > feature or send some useful articles / free APIs and so on.
>
> Hmm, I think there is a recursion problem here - if we tell you, then you'd
> be plagiarizing our solution :)
>
> > Expecting your positive reply ......
>
> Seriously though, unless I'm missing something, I can't see how this would
> be possible.  I suppose you could use techniques such as comparing the
> number of similar words between articles, but that's not really exact, and
> likely to have incorrect results.  Plus, you're looking to do this
> plagiarism check across the whole Internet?
>
> Even humans, who read different articles, have a hard time determining if
> something is actually plagiarized or not - I think doing it automatically
> is
> nearly impossible.
>
> H
>
>
>
>
> ------------------------------
>
> Message: 2
> Date: Sun, 2 May 2010 14:40:06 -0400
> From: Mutaz Musa <mutazmusa at gmail.com>
> To: NYPHP Talk <talk at lists.nyphp.org>
> Subject: Re: [nycphp-talk] Plagiarism Checker in PHP
> Message-ID:
>        <q2t602c8ac1005021140udd025bcfwbf371cea82f7678d at mail.gmail.com>
> Content-Type: text/plain; charset="iso-8859-1"
>
> Actually turnitin.com, a website where students turn in schoolwork, checks
> for plagiarism and from what I understand does a pretty good job. They
> compare both student-to-student and student-to-literature. I'm not sure how
> they pull it off, probably a proprietary algorithm that tries to match
> student submissions against their database of content where no match = no
> plagiarism.  It seems like a mammoth task though. Turnitin.com actually
> crawls academic journals, books, etc. to populate their database.
>
> If you're interested only in comparing submissions to your site against one
> another that may be more manageable. If on the other hand you want to
> compare against online/print content in general then that will prove a real
> challenge. Finally, if you have the money maybe you can register with
> turnitin.com - although my impression is that their primarily for
> universities/colleges.
>
> http://turnitin.com/static/index.html
>
> Best,
> Mutaz
>
> On Sun, May 2, 2010 at 1:58 PM, Hans Zaunere <lists at zaunere.com> wrote:
>
> > > Hi Friends,
> >
> > Hi,
> >
> > > I am developing blogger website using PHP & MySQL. And, I would like
> > > add the feature of Plagiarism Checker.
> > >
> > > If any one posted an article in my website, that article must be unique
> > > and original.
> > >
> > > Could you please tell / suggest me how to develop Plagiarism Checker
> > > feature or send some useful articles / free APIs and so on.
> >
> > Hmm, I think there is a recursion problem here - if we tell you, then
> you'd
> > be plagiarizing our solution :)
> >
> > > Expecting your positive reply ......
> >
> > Seriously though, unless I'm missing something, I can't see how this
> would
> > be possible.  I suppose you could use techniques such as comparing the
> > number of similar words between articles, but that's not really exact,
> and
> > likely to have incorrect results.  Plus, you're looking to do this
> > plagiarism check across the whole Internet?
> >
> > Even humans, who read different articles, have a hard time determining if
> > something is actually plagiarized or not - I think doing it automatically
> > is
> > nearly impossible.
> >
> > H
> >
> >
> > _______________________________________________
> > New York PHP Users Group Community Talk Mailing List
> > http://lists.nyphp.org/mailman/listinfo/talk
> >
> > http://www.nyphp.org/Show-Participation
> >
> -------------- next part --------------
> An HTML attachment was scrubbed...
> URL: <
> http://lists.nyphp.org/pipermail/talk/attachments/20100502/7cd7f7c2/attachment-0001.html
> >
>
> ------------------------------
>
> Message: 3
> Date: Sun, 2 May 2010 14:42:05 -0400
> From: Mutaz Musa <mutazmusa at gmail.com>
> To: NYPHP Talk <talk at lists.nyphp.org>
> Subject: Re: [nycphp-talk] Plagiarism Checker in PHP
> Message-ID:
>        <w2n602c8ac1005021142yc63e3272ye681533738599a68 at mail.gmail.com>
> Content-Type: text/plain; charset="iso-8859-1"
>
> Also, sorry for the double post but check out http://www.ithenticate.com/
>
> On Sun, May 2, 2010 at 2:40 PM, Mutaz Musa <mutazmusa at gmail.com> wrote:
>
> > Actually turnitin.com, a website where students turn in schoolwork,
> checks
> > for plagiarism and from what I understand does a pretty good job. They
> > compare both student-to-student and student-to-literature. I'm not sure
> how
> > they pull it off, probably a proprietary algorithm that tries to match
> > student submissions against their database of content where no match = no
> > plagiarism.  It seems like a mammoth task though. Turnitin.com actually
> > crawls academic journals, books, etc. to populate their database.
> >
> > If you're interested only in comparing submissions to your site against
> one
> > another that may be more manageable. If on the other hand you want to
> > compare against online/print content in general then that will prove a
> real
> > challenge. Finally, if you have the money maybe you can register with
> > turnitin.com - although my impression is that their primarily for
> > universities/colleges.
> >
> > http://turnitin.com/static/index.html
> >
> > Best,
> > Mutaz
> >
> >
> > On Sun, May 2, 2010 at 1:58 PM, Hans Zaunere <lists at zaunere.com> wrote:
> >
> >> > Hi Friends,
> >>
> >> Hi,
> >>
> >> > I am developing blogger website using PHP & MySQL. And, I would like
> >> > add the feature of Plagiarism Checker.
> >> >
> >> > If any one posted an article in my website, that article must be
> unique
> >> > and original.
> >> >
> >> > Could you please tell / suggest me how to develop Plagiarism Checker
> >> > feature or send some useful articles / free APIs and so on.
> >>
> >> Hmm, I think there is a recursion problem here - if we tell you, then
> >> you'd
> >> be plagiarizing our solution :)
> >>
> >> > Expecting your positive reply ......
> >>
> >> Seriously though, unless I'm missing something, I can't see how this
> would
> >> be possible.  I suppose you could use techniques such as comparing the
> >> number of similar words between articles, but that's not really exact,
> and
> >> likely to have incorrect results.  Plus, you're looking to do this
> >> plagiarism check across the whole Internet?
> >>
> >> Even humans, who read different articles, have a hard time determining
> if
> >> something is actually plagiarized or not - I think doing it
> automatically
> >> is
> >> nearly impossible.
> >>
> >> H
> >>
> >>
> >> _______________________________________________
> >> New York PHP Users Group Community Talk Mailing List
> >> http://lists.nyphp.org/mailman/listinfo/talk
> >>
> >> http://www.nyphp.org/Show-Participation
> >>
> >
> >
> -------------- next part --------------
> An HTML attachment was scrubbed...
> URL: <
> http://lists.nyphp.org/pipermail/talk/attachments/20100502/53705f9d/attachment-0001.html
> >
>
> ------------------------------
>
> Message: 4
> Date: Sun, 2 May 2010 16:43:19 -0400
> From: Justin Dearing <zippy1981 at gmail.com>
> To: NYPHP Talk <talk at lists.nyphp.org>
> Subject: Re: [nycphp-talk] Plagiarism Checker in PHP
> Message-ID:
>        <o2y5458db3c1005021343n25d79d4ex40dd3c00d9034fb9 at mail.gmail.com>
> Content-Type: text/plain; charset="iso-8859-1"
>
> On Sun, May 2, 2010 at 1:58 PM, Hans Zaunere <lists at zaunere.com> wrote:
>
> > > Hi Friends,
> >
> > Hi,
> >
> > > Could you please tell / suggest me how to develop Plagiarism Checker
> > > feature or send some useful articles / free APIs and so on.
> >
> >
> > Seriously though, unless I'm missing something, I can't see how this
> would
> > be possible.  I suppose you could use techniques such as comparing the
> > number of similar words between articles, but that's not really exact,
> and
> > likely to have incorrect results.  Plus, you're looking to do this
> > plagiarism check across the whole Internet?
> >
> > My understanding is many CS professors do this for programming homework.
> They are looking for exact matches. Apparently that catches a lot of
> people.
>
> I think chopping up an article into an array of sentences, and throwing a
> few into google would be a good approach. Submit 25% of the sentences to
> google as exact phrase matches. Throw the first 10 result urls into an
> array
> for each. Sort the urls and see how many are the same.
>
> Justin
> -------------- next part --------------
> An HTML attachment was scrubbed...
> URL: <
> http://lists.nyphp.org/pipermail/talk/attachments/20100502/9dda592a/attachment-0001.html
> >
>
> ------------------------------
>
> Message: 5
> Date: Sun, 2 May 2010 18:51:49 -0400
> From: "Christopher R. Merlo" <cmerlo at ncc.edu>
> To: NYPHP Talk <talk at lists.nyphp.org>
> Subject: Re: [nycphp-talk] Plagiarism Checker in PHP
> Message-ID:
>        <s2s946586481005021551w28bffffepa797335c4e25b839 at mail.gmail.com>
> Content-Type: text/plain; charset=ISO-8859-1
>
> On Sunday, May 2, 2010, Justin Dearing <zippy1981 at gmail.com> wrote:
> >
> > My understanding is many CS professors do this for programming homework.
> They are looking for exact matches. Apparently that catches a lot of people.
>
> Actually, checking for exact matches won't catch the students who
> change nothing but identifiers. But there's a really cool free tool
> provided by someone at UC Berkeley called Moss that does some sort of
> lexical analysis, along with pattern matching, that has helped me
> catch lots of unscrupulous programming students. (Yes, lots. The rare
> downside of teaching at a community college.)
>
> > I think chopping up an article into an array of sentences, and throwing a
> few into google would be a good approach.
>
> And that's more or less what I do when I suspect plagiarism in a
> written assignment.  I don't (yet) do this in any automated way; only
> when it seems obvious I should. Don't forget to surround the sentence
> with quotes for Google to reduce the amount of false positives.
> -c
>
>
> ------------------------------
>
> Message: 6
> Date: Mon, 3 May 2010 07:38:02 -0400
> From: "Hall CTR Leam" <leam.hall.ctr at usmc.mil>
> To: <talk at lists.nyphp.org>
> Subject: [nycphp-talk] Password security for SQLite based pages?
> Message-ID:
>        <
> 06A7A82C48A99643B8B77C99946AFECB02F1C955 at mcusquanez02v.mcdsus.mcds.usmc.mil
> >
>
> Content-Type: text/plain; charset="us-ascii"
>
> Morning all!
>
> I'm looking to not use MySQL for some web pages, but to instead try SQLite.
> There's nothing in the database that would need to be protected from
> reading, and it's a small site that doesn't need a full blown database.
> However, I'd like to limit access to updating. Would it be enough just to
> have a web page that you had to log in via .htaccess? Is there a better way?
>
> Thanks!
>
> Leam
>
>
> --------------------------------------------
> Leam Hall
> UNIX Systems Administrator
> Contractor for Smartronix  ( CMMI Level 3 ISO 9001:2000 FS 91000 )
> Com:   229.639.6028
> Email: Leam.Hall.ctr at usmc.mil
> --------------------------------------------
> -------------- next part --------------
> A non-text attachment was scrubbed...
> Name: smime.p7s
> Type: application/x-pkcs7-signature
> Size: 5206 bytes
> Desc: not available
> URL: <
> http://lists.nyphp.org/pipermail/talk/attachments/20100503/f261fa34/attachment.bin
> >
>
> ------------------------------
>
> _______________________________________________
> talk mailing list
> talk at lists.nyphp.org
> http://lists.nyphp.org/mailman/listinfo/talk
>
> End of talk Digest, Vol 43, Issue 4
> ***********************************
>



-- 
T.R.Parthasarathy
+919884117406

“You have to grow from the inside out. None can teach you, none can make you
spiritual. There is no other teacher but your own soul.” - Swami Vivekananda
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.nyphp.org/pipermail/talk/attachments/20100503/20ffce90/attachment.html>


More information about the talk mailing list