Submitted by John-The-Bomb-2 t3_127wy7i in MachineLearning
News just released via this Tweet.
Source code here: https://github.com/twitter/the-algorithm
I just listened to Elon Musk and Twitter Engineering talk about it on this Twitter space.
Submitted by John-The-Bomb-2 t3_127wy7i in MachineLearning
News just released via this Tweet.
Source code here: https://github.com/twitter/the-algorithm
I just listened to Elon Musk and Twitter Engineering talk about it on this Twitter space.
Yeah, like Elon or not, the push for open source is always going to be beneficial to the community. Ironic how twitter is more open than ____AI.
Twitter is already established as a brand to near saturation and Elon has more money than god. It's the perfect combo for ML philanthropy. Now waiting for that Tesla vision algorithm...
Most infrastructure code like computer vision code, device drivers, etc are either not culturally relevant or have little cultural relevance.
I don't think it makes any sense to prioritize them when things like twitter have much more direct cultural impact. It would be great if my network card driver was open source, but does it really matter? Is it worth prioritizing? Will it likely have any cultural relevance? To most people the answer to all these questions is no.
Elon didn’t open source it.
Then who did, if not the owner/CEO
> world-class complex recommendation & ranking system
https://twitter.com/amasad/status/1641879976529248256?s=20
I mean surely it is great but my recommendations weren't exactly stellar in those years.
this aprt is not used for recommendations though. this is for analytics and internal testing and ensuring that different groups (+elon) don't get disadvantaged.
I wonder did they add that flag before or after the day when they accidentally made people see only Elon's tweets on their timeline: https://www.theverge.com/2023/2/13/23598514/twitter-algorithm-elon-musk-tweets
Parts of this code dump are for recommendations and ranking.
Idk man as a fairly well seasoned MLE I find their general architecture and scale of their combined models to be fascinating in-and-of itself.
Twitter sucks ass - but this is a beautiful piece of ML Engineering.
Where to start with, it’s such a huge project 😳
Readme.md
Sorry, had to 🤓
I checked the commit history but it has only one commit. So this is a pretty straight forward place to start: https://github.com/twitter/the-algorithm/commit/7f90d0ca342b928b479b512ec51ac2c3821f5922
So how do we know this is the repo used by Twitter and not some stripped down version of it
This quite obviously isn't the repo used by twitter.
It is a pretty large and well put together documentation epic & consolidation of multiple microservices.
Whether the content is 100% reflective of whats deployed is completely unclear. But its not "fake" that's for sure, its genuinely too many man-years of work to not be in-essence real.
It will take time, but I'd imagine it should be possible to derive a method of determining this by observation.
Algorithms like this will have fingerprints.
We don't and likely we won't know.
Unless perhaps someone internal checks and leaks important missing details that later on...
But for now, it does seem robust enough to be reflective of what they have probably been using up to some recent - but that's still just speculation
> Sorry, had to
Well, your reply was much more polite than the old "RTFM!"
> Here we have a world-class complex recommendation
...You know this is twitter's recommender system right? All the tweets I interact with are ML related from very 'left' people like Jeremy Howard.
My recommender system could legit be:
if interested_in_finance_or_ML:
recommend_alt_right_hate_speech_accounts()
recommend_crypto_scam_ads()
Get rid of the if statement and you just recreated Twitter's recommendation algorithm
In fairness, it doesn't really matter what you interact with. Twitter is just a sewer of alt-right hate speech for everyone.
It's a pretty cool resource to get to look at an enterprise recommendation algorithm like that.
​
An aside, if you want a chuckle, search the term "Elon" in the repo:https://github.com/twitter/the-algorithm/search?q=elonhttps://github.com/twitter/the-algorithm/search?q=elon&type=issues
​
[edit 1]
since it's gone now, here's the back up provided by u/MjrK:https://i.imgur.com/jxqaByA.png
[edit 2] lol
https://github.com/twitter/the-algorithm/commit/ec83d01dcaebf369444d75ed04b3625a0a645eb9#diff-a58270fa1b8b745cd0bd311bed9cd24c983de80f96e7bd445e16e88b61e492b8L225
>An aside, if you want a chuckle, search the term "Elon" in the repo:https://github.com/twitter/the-algorithm/search?q=elon
However, it does not seem to affect the recommendation algorithm.
I think we can safely go with Occam's Razor here. I would assume the "influential celebrity" is the "power_user" type, see: https://i.imgur.com/s6ntUil.png
​
Either way, I'm not surprised they are giving tweets from Musk their own type. Why wouldn't they. It probably became necessary to deal with his antics.
It's kinda nice to see PageRank is still being used as one of the components of the algorithm
Apparently there is an Elon feature as well as for Republicans and Democrats?
https://github.com/twitter/the-algorithm/blob/7f90d0ca342b928b479b512ec51ac2c3821f5922/home-mixer/server/src/main/scala/com/twitter/home_mixer/functional_component/decorator/HomeTweetTypePredicates.scala#L228
Well devil is in the detail, don't miss the fun part in commit messages :)
Please note we have force-pushed a new initial commit in order to remove some publicly-available Twitter user information. Note that this process may be required in the future.
I'm still able to access this link though, even on private browser
He can’t accept that nobody wants him on their timeline
Wonder whether they included the Elon+1000 and Can'tBlockHim mods in this version?
As far as I know, there was never any evidence to back up those claims
The claims are plausible accidents from a technical perspective. It's very possible for a system which does blocklists to choke up on the longest Blocklist it has ever seen and fail to add new things to the list.
[removed]
And the ElonAutoUnblocker
If it's anything like their algorithm that shows me the tweets from a trending, I wouldn't want it.
There is too much money at stake for there not to be additional invisible weights that are able to be tweaked by Twitter behind the scenes.
For example, I would imagine a 2 billion dollar stake by the Saudi's would purchase huge influence. This goes for anyone else that Elon "hangs" with during the Olympics or the Superbowl, or FIFA WorldCup.
Those are probably part of the advertisement system.
There is no way this is what Twitter is using.
[deleted]
We get it, space man bad but it’s a for profit company. Nobody was expecting 100% of the code. How much did you pay for the self driving bridge?
I don't take any CEO's words at face value without considering the monetary values and incentives behind that tongue.
A large project like this being open-sourced, even if it's a very old or heavily stripped down version, is always a great thing for the community.
[deleted]
[removed]
ZestyData t1_jegdmzo wrote
Putting aside the political undertones behind many peoples' desire to publish "the algorithm", this is a phenomenal piece of educational content for ML professionals.
Here we have a world-class complex recommendation & ranking system laid bare for all to read into, and develop upon. This is a veritable gold mine of an an educational resource.