| advertise add site services publishers database health videos | ![]() | about toolbar stats live show health store more stuff JOIN/LOGIN |
List of Courses: New York New Jersey Public Health Training Center nynj-phtc.org |
I (Mikaey) have written a bot (which I call "AarghBot") to compile a list of potential cut-and-paste moves. How do I have a bot determine whether or not a cut-and-paste move has occurred? Here's how:
In these lists, you will see a diff score for each entry. The diff score is computed as the number of lines that changed between the pretext and the posttext, divided by the total number of lines in the pretext. The diff score is designed to be a measure of uncertainty that the two articles in question are based off the same text -- e.g., a higher diff score would mean that it is less likely that the two articles are based off the same text, while a lower diff score would mean that it is more likely that the two articles are based off the same text. Likewise, a diff score of zero indicates that the two texts are identical -- the only exceptions allowed are whitespace and casing. Note that, when the diff is performed, empty lines are stripped from both texts, and the diff is performed case-insensitive and whitespace-insensitive. To admins who work on this list: Please feel free to remove any items from the list that you take care of. To anyone who works on this list: If you come across a false positive, tag the source page with {{nahmc|<destination page>}}, where <destination page> is the page in the destination column of the report. This will cause the bot to ignore that particular match on the next run. ("nahmc" = "Not A HistMerge Candidate".) [edit] The ListsEach list contains 500 items.
|
| ↑ top of page ↑ | about thumbshots |