I have a list of communities, each with total votes, upvote percentage, and the community name. I want to sort the list by ‘engagement,’ which would be some combination of total votes and upvote percentage. What is the best way to do this? What would be the best measure of ‘engagement’ with each community given this data?

  • it_depends_man@lemmy.world
    link
    fedilink
    arrow-up
    1
    ·
    26 days ago

    You want some kind of decay function for when that engagement happened.

    The rest is sort of up to you and depends on your math intuition a bit. If you do something like total (votes/10.000)+% relative stuff will weigh heavily until you get close to 10.000 then the votes will dominate no matter how positive the post was. But the 10.000 is arbitrary.

    My advice would be to create some fake data that are plausible scenarios, (well liked, low vote), (lots of votes, medium %), (lots of votes, but old) and then you experiment with some functions and curves until you find a mix you like.