-
Notifications
You must be signed in to change notification settings - Fork 308
roll our own automatic fraud prevention #360
Comments
|
From @sigmavirus24 on #354:
|
I assume you meant here instead of #60 ;). But yeah based off of @dangerdave's blog post that you posted elsewhere, I would say just store the IP addresses if you're so inclined. I'm not a fan in general, but I use other services that store data I object to them keeping. And like I said, it's really a simple API call. You might want to upgrade the version of requests you're using so that you can just use the Like I said accessing the data is pretty simple, it would be a simple For their number of forks, you would have to unfortunately iterate over And finally events would only need one request (or as many as you'd like). You could gauge dates and whatnot. Then again if you see everything is from one day you can't draw any conclusions from that. Why? Because you have some uber active users who's entire first 30+ events were done that day. Events include issue comments, code comments (review comments), issue comments, repo creation, following a user, starring a repository, etc. You already have the repo creation dates from iterating over the repositories they have so that wouldn't be of interest. The others don't imply anything other than they know how to use an API to create random comments on each other's repositories. I guess just checking for the existence of an event would be ok. This is easy to game though and should have a very low weight. This on it's own though is (as I explain more of how it would work) insufficient. If they had a large enough number of accounts here, they could easily follow each other which will artificially boost their followers/following number. They could create their own repos and fork each other's but then you'd see that that was all done on the same day so that would take more work/time/planning on their part. So this could certainly be used in conjunction with something else, but I'm not sure what. Another thing that would be easy to game (but only vein humans do) is set their avatar_url and their real name. Again, this would not work especially since I only recently added my real name to my GitHub account and there may exist users who were just too lazy to do so. |
@shawndavenport for any accounts that are detected/suspected to be fraudulent, how would @whit537 go about reporting them to you guys? Perhaps automate an email to [email protected]? |
We could talk to the guys at Work for Pie on how to traverse and score the Github API, or perhaps even integrate their score (or a portion thereof). From a bigger picture though, Gittip isn't limited to developers. No matter how fancy we get in processing info from Github, we're not going to be able to use it to score someone who is, for instance, primarily a political activist/journalist. |
Their scoring takes a short while and mainly goes over the repos to use stargazers and forks as a means of scoring your code skills. And yeah, I never claimed this would be good for all GitTippers, only those with GitHub accounts. Considering you can register via Twitter, this would fail for those users. And since it seems like there is going to be a modicum of human review, activists and journalists would be caught. My scoring is just to give @whit537 a way to tackle the seemingly most/least suspicious users first, instead of having just going user to user. |
@sigmavirus24, hopefully the volume will be low, so for the time bring feel free to email me directly. |
Awesome. Thanks @shawndavenport |
Closing as stale. |
ht @exratione on their blog via hn
Note on that HN thread the suggestion, that by hashing the IP address we can forestall privacy concerns (#345).
The text was updated successfully, but these errors were encountered: