It came out in conversation today if there were other Ruby on Rails sites bigger than Twitter. The answer is, yes.
I get a lot of mileage out of ripping on Rails—besides being easy, it’s also quite fun, and I’m always game for a cheap laugh. (I’ll continue to get some more blog mileage out if it this summer). But it is important to remember, next time you go Fail Whaling, that there are other Rails sites out there, some of which you’ve heard of and use.
Some of them even run.
If you want to read this graph correctly, it basically says that Scribd is easily the largest Rails site on the internet, and there are at least three others with traffic-parity with Rails.
Scribd hosts a presentation they made last year on how they scale Rails. It’s worth a look, though I’m curious what their numbers are today, and what a “request” is. The need (at the time) for only a single web server had to do with the mostly-static/segmented nature of their problem area. Remember: many CMS-style news sites reached very large traffice levels before memcache even existed. I’m also very curious why they’re disk-bound IO on their appserver. The section about analytics is very humorous in its naïveté.
In spite of the caveat of the problem space, it’s a pretty good job to do all that with three people. It reminds me when I was shocked to find out Cal built Flickr on his laptop (buy his book—review).
YellowPages.com made a high profile switch from J2EE to Rails last year. The cynical me notes that the only thing that gets beat on more than Rails in my universe is J2EE. It’s impressive that they handle 23 million visitors a month, which puts them around half a Tagged by dead reckoning. Of course, if you are counting dynamic page views, we do more in a day than they do in a month. So I guess it depends on how you count. Another cynical thought is the question with improvements like memcache which has come out since YellowPages was first built, they still had to write some binary-level code to Ruby and increase the server count.
But I’ve always said re-architecture midstream is one of the hardest things to do, so the team there deserves props, because they managed to change both the architecture and the app platform midstream with nary a hitch.
I love Hulu. They’ve had phenomenal growth that is very deserved.
Of course the cynical me notes that it’s no YouTube in traffic or even in community structure. And they managed to migrate from PHP to a pure Python implementation fairly easily. It makes me think, that there really isn’t much of a challenge to build any mostly static high-bandwidth content delivery website that gets huge traffic in any platform. Yeah, I’m looking at you FunnyOrDie…and scribd…and Yellow Pages. 😉
Still, I love Hulu, if only to get my BSG fix. You should join.
One Friday last year, I cut across Washington Square in order to drop off a letter on my way to work. I passed by a group of people starting a BBQ and stared at one of them who stared back at me. When I got in to work, before we headed to our 50 millionth user party, I asked a co-worker, “Hey, is that guy you mentioned who films himself 24/7 asian and have a hat on his head with a camera on it and a backpack?”
“Yeah, I think so.”
“Then, I think I saw him on my way to work.”
That’s the first time I met Justin of Justin.TV.
They’ve since moved to posher digs from the hellhole in North Beach they used to inhabit, and their traffic is also respectably within the realm of Twitter.
Another reason that they deserve mention is that, while they are one of those high-bandwidth content-driven site, Justin.TV does stand out from the others above in two key ways: live-streaming and a large social networking component. I am mildly curious of how much the live-stream and live chat architecture is powered by twitter—but not enough to bother asking at a party—besides I have a feeling I already know that answer.
But it does go to show you, that the lines between “content management” and “social networking”—the lines between “data driven” and “user driven”—are being blurred.
Or maybe it’s that social networking, like content-management, is becoming old hat, that now the solved problems are solved.
And that means, it soon won’t matter what language you choose or architecture you use.
Anyone know of any good memcache bindings for HOMESPRING? 🙂