On top of that, I personally set my default youtube app to be newpipe, so if I'm just casually browsing lemmy or elsewhere I can click a youtube link and figure out what it is, even watch it if I want, without it effecting my normal account and watch history.
If I like the video and want more like it, I'll share to the Youtube (Revanced) app and finish it there.
Then you have to create a framework for evaluating the effect of the addition of each source into "positive" or "negative". Good luck with that. They can't even map input objects in the training data to their actual source correctly or consistently.
It's absolutely possible, but pretty much anything that adds more overhead per each individual input in the training data is going to be too costly for any of them to try and pursue.
O(n) isn't bad, but when your n is as absurdly big as the training corpuses these things use, that has big effects. And there's no telling if it would actually only be an O(n) cost.