Announcing OpenLemmyStats.org: Publicly Queryable Vote History + Other Hidden Data for Any Lemmy User!

booty_flexx@lemmy.world · edit-2 2 years ago

Announcing OpenLemmyStats.org: Publicly Queryable Vote History + Other Hidden Data for Any Lemmy User!

zinklog@lemmy.fmhy.ml · 2 years ago

Was with you until the money point. It’s extremely easy to get this data and there will be many open source versions doing this thing.

But I agree that who upvoted a post shouldn’t be federated.

booty_flexx@lemmy.world · 2 years ago

I totally get what you’re saying.

I think there is (unfortunately) value to be mined from packaging the data conveniently, or offering a subscription service to make it trivial to query for anyone without sysadmin or database skills. Or just throw porn ads on it or some shady ad network that doesn’t mind being placed on questionable sites.

zalack@kbin.social · edit-2 2 years ago

I really think Lemmy, Kbin, and Mastodon need to figure out a way to have a default terms of service that ship with their product which forbids using the API to collect data for commercial purposes.

Additionally, there should be a way for users to indicate licensing for individual posts, with a default license instance admins can set.

That way for-profit instances could be forced to filter out posts with licenses that do not allow for-profit use. Honestly, even just a simple check mark “[ ] allow for-profit republication”, and have two licenses that can be attached: one that allows for-profit use and one that does not.

FaceDeer@kbin.social · 2 years ago

Whoever’s doing this wouldn’t be using Lemmy, Kbin, or Mastodon code. They’d likely write up some custom ActivityPub service that listened in on that protocol. ActivityPub is an open protocol so trying to put some kind of “no profit” restriction on it at this point would be impossible, and having it on there from the start would have killed its adoption.

Lemmy, Kbin, and Mastodon are all currently licensed under the GPL so good luck trying to retroactively put that genie back in the bottle too. The GPL allows for-profit companies to run the code with no further restrictions.

Europe’s got the GDPR, if you really want to try some kind of legal route to counter this, but I don’t think it’s very likely to work well.

OnionFutures@vlemmy.net · 2 years ago

But I agree that who upvoted a post shouldn’t be federated.

This also surprised me. I wonder is it necessary for technical reasons to prevent repeated upvoting of a submission by the same user?

ColonelPanic@lemmy.ml · 2 years ago

I’m pretty sure there is no particular reason why it’s done this way. It’s just the easiest method to coomunicate upvotes across different servers. There are already a lot of ideas for doing it differently or more efficient (e.g. vote aggregation) but that requires a more sophisticated architecture:

Vote aggregation also makes faking votes much more efficient and requires different detection methods. Of course, a spam server can also invent users or votes but it’s a bit more complicated.
Aggregation in any form can be hard to implement because it should be flexible enough to reduce load but not increase delay or make tracking a consistent state even harder. Finding the right configuration will be difficult and go through a lot of trial and error. Should be easier though now that more people are working on the code.
Keep in mind that Lemmy should also be able to communicate with other services across the Fediverse like Mastodon via ActivityPub. I’m not sure if there is something in the standard for message aggregation yet. It’s definitely being discussed because Mastodon, Pixelfed and Peertube all have or went thorugh the same growth problems as Lemmy in terms of scaling, spam and security concerns. If there’s a good solution it will likely come through the AP standard.