this post was submitted on 07 Dec 2025
394 points (99.0% liked)

Science Memes

17598 readers
1887 users here now

Welcome to c/science_memes @ Mander.xyz!

A place for majestic STEMLORD peacocking, as well as memes about the realities of working in a lab.



Rules

  1. Don't throw mud. Behave like an intellectual and remember the human.
  2. Keep it rooted (on topic).
  3. No spam.
  4. Infographics welcome, get schooled.

This is a science community. We use the Dawkins definition of meme.



Research Committee

Other Mander Communities

Science and Research

Biology and Life Sciences

Physical Sciences

Humanities and Social Sciences

Practical and Applied Sciences

Memes

Miscellaneous

founded 2 years ago
MODERATORS
 
you are viewing a single comment's thread
view the rest of the comments
[–] Th4tGuyII@fedia.io 28 points 19 hours ago (2 children)

Genuinely. As a student I don't think I ever saw a Turnitin score for my work below 40%. There are only so many ways to wrute a sentence about the same thing, so its impossible to not accidentally plagiarise someone's works.

I remember one lecturer telling me that they don't really look at the % unless its something aggregious like +70%. But more often they're looking for patterns in what it highlights.

Loads of tiny highlights with individual sources are likely to be a false positive, but big chunks of highlights from only a couple of sources is likely to be a true positive.

[–] fonix232@fedia.io 23 points 17 hours ago

Plagiarism should only ever be counted for explicitly unique sentences that provide actual value.

It's actually an ongoing debate in software engineering, due to licensing, as to what you can consider "stolen code" - i.e. plagiarism.

In fact things went as far as to some companies employing AI-aided automatic cease-and-desist deliveries on GitHub, but the system was so badly configured, it detected even the most basic logic bits as license infringement. Things that are standardised in software development - like, for example, for loops, that happened to have generic parameter names (e.g if you were to create a graphic subsystem for displaying Views, whatever the primary implementation may be, you'd iterate through all views with a for loop, making it a generic call such as for(val view in views) { [do something here] }).

Well this AI aided detector was so brilliant that it detected such minute coincidences of codebases as legitimate violations (as if any company could copyright generics), and sent these spurious C&Ds to dozens of git repos. What's even worse is that the initial company's codebase used some open source libraries that were directly attacked... for being 100% copies of their own codebase.

IMO as long as the code/sentence isn't a provably unique statement, plagiarism shouldn't apply. A whole paragraph having 80%+ similarity to something unique? Now that's worrying enough to investigate.

[–] fossilesque@mander.xyz 6 points 17 hours ago (1 children)
[–] Th4tGuyII@fedia.io 14 points 16 hours ago (1 children)

Fair enough. I imagine as a PhD its easier to avoid since you're doing new research, so you're presenting unique information with (in theory) unique sentences.

Whereas for a lot of undergrad students, up until the tail end of their degrees, they're writing about fairly extensively covered topics, so you're much more likely to accidentally steal wordings from others who have already written about them. In fact at that stage, I'd bet having too low a plagiarism score would more likely indicate you're barking up the wrong tree.

[–] fossilesque@mander.xyz 5 points 16 hours ago