Google crawls the web, caches and indexes the pages it finds into a database, and provides a consumer user interface to search that database. What if we could build other applications on top of that same database?

Not having access to Google’s copy of the web, other companies do the same thing as Google themselves in order to provide their services. Some examples:

  • Attributor lets large publishers find video, image, and text copyright infringements.
  • TinEye lets users upload an image and see where it is used online.
  • MajesticSEO lets website owners track backlinks to their pages.

Amazon has a growing list of Public Data Sets. What if they could provide cached “views” of the web that could be processed using EC2 or Elastic MapReduce? That would allow more entrepreneurs to think big about using the whole of the web as a data set.

Amazon’s Public Data Sets already has some cached versions of Wikipedia. Companies (like Freebase) and researchers (such as Jun Liu & Sudha Ram) seem to use them. My wish is we had such a query-able data store for all websites.

Until we have such a data source and platform, here’s Ilya Grigorik’s excellent presentation on Building a Mini-Google in Ruby to do it ourselves.

An aside: For simple site search problems, I prefer the design of Bing’s API over Google’s Site Search. We use Bing for LearnHub Search.

I have worked on projects which have grown quickly through SEO, email marketing, and partnerships. (PoliceLink became the largest law enforcement site on the web 4 months after launch. LearnHub became a top-5 education site in India in one year.) But I’ve never built anything around a viral distribution strategy.
So I look on with marvel at excellent projects like Dropbox which seem to have grown entirely through viral adoption. This is not a fun social facebook app, this is an otherwise boring online storage service. According to TechCrunch in January, Dropbox launched in 2007 and has over 4 million users. They don’t seem to do SEO, SEM, email newsletters, or even much PR. How do they do it?
Dropbox
The product’s core functionality, synching files between computers and the web, works as advertised. The Mac client is really well integrated into the OS. The web interface could be better, but its serviceable. Free accounts come with 2 GB space.
The cool trick, from my perspective, is how they’ve grafted a viral distribution method onto the core service. Its a seamless fit. Here’s the pitch, cut from their UI:
Extra Space
“For every friend who joins Dropbox, we’ll give you both 250 MB of bonus space.”
If you were swayed to sign up for the 2 GB of free storage, then an extra 250 MB for each person you invite is decent free upgrade. It worked on me: I invited a few friends. Why not?
Dropbox seems to be planning to go after the enterprise storage market next. I will be very interested to see if they try a similar viral distribution strategy in that space, or if they have to go the typical enterprise sales route (salespeople, conferences, long sales cycles, support). I don’t think I’ve ever seen online viral distribution for B2B products. Imagine:
“For every company that your company invites that joins Dropbox, we’ll give both companies 50 GB of bonus space.”
That won’t work. Unless their consumer market presence is a big help, and it may, I don’t see them having any natural distribution advantage in the enterprise space. It will be interesting to watch.

« Previous Page



Follow

Get every new post delivered to your Inbox.