The Encyclopedia of Life

Wow! This is interesting. Wikipedia for biologists: inspired by E.O. Wilson. “Imagine”, he says, “an electronic page for each species of organism on Earth, available everywhere by single access on command.”

Posted in Web

Google pokes a sharp stick in Microsoft’s eye

You may have seen the news that Google is launching its own (open source) browser, codenamed Chrome. According to the company blog,

Under the hood, we were able to build the foundation of a browser that runs today’s complex web applications much better. By keeping each tab in an isolated “sandbox”, we were able to prevent one tab from crashing another and provide improved protection from rogue sites. We improved speed and responsiveness across the board. We also built a more powerful JavaScript engine, V8, to power the next generation of web applications that aren’t even possible in today’s browsers.

In reality, this takes us back to the original threat/promise of Netscape — the thing that threatened Microsoft so much that it set out to destroy Netscape. This was the idea that the browser was destined to become the key piece of software — almost an operating system in its own right.

Google Chrome takes up that idea, and holds out the promise of making it a reality. As Nick Carr puts it, Chrome

promises a similar leap in the capacity of the cloud to run applications speedily, securely, and simultaneously. Indeed, it is the first browser built from the ground up with the idea of running applications rather than displaying pages. It takes the browser’s file-tab metaphor, a metaphor reflecting the old idea of the web as a collection of pages, and repurposes it for application multitasking. Chrome is the first cloud browser.

See the exposition in Google’s Comic Book for an outline of the thinking that went into Chrome. It’s basically the first multi-threaded browser.

This is an important strategic move by Google. To quote Carr again,

Google is motivated by something much larger than its congenital hatred of Microsoft. It knows that its future, both as a business and as an idea (and Google’s always been both), hinges on the continued rapid expansion of the usefulness of the Internet, which in turn hinges on the continued rapid expansion of the capabilities of web apps, which in turn hinges on rapid improvements in the workings of web browsers.

To Google, the browser has become a weak link in the cloud system – the needle’s eye through which the outputs of the company’s massive data centers usually have to pass to reach the user – and as a result the browser has to be rethought, revamped, retooled, modernized…

I’ve no doubt that this development will be presented in the mainstream media as Google’s “attempt to capture the browser market”. That would be a misconception IMHO. By making Chrome open source Google is ensuring that any browser that seeks to stay competitive has to take up the multi-threading idea. Which will make cloud computing even more pervasive. Which will further increase Google’s importance. As a strategy, it’s fiendishly clever.

And just in case the folks in Cupertino are sniggering, this is a harbinger of things to come on the mobile phone front too. Google has sussed that the (closed) iPhone will be difficult to beat, so its attack is based on an open platform (Android). Smart.

Many thanks to Gerard for the original link (even though he hates the Comic Book!)

LATER: I can’t run Chrome because the first beta release only runs under Windows Vista (if you please), but TechCrunch has been using it and likes it a lot.

STILL LATER: Kate Greene has a useful overview in Tech Review. And the Register published a perceptive piece by Tim Anderson.

Internet memes

Michael has found a timeline of memes. Hooray! Now, instead of having to embark on long explanations to sceptical interrogators, I can simply point them to it.

What’s a meme? Wikipedia says the term “denotes any learned feeling, thought or behavior. Examples include thoughts, ideas, theories, practices, habits, songs, dances and moods. Memes propagate themselves and can move through a sociological ‘culture’ in a manner similar to the behavior of a virus”.

I prefer to say that a meme is an infectious idea.

How big is the web?

Nobody really knows, but here is an interesting post on the Official Google Blog…

We’ve known it for a long time: the web is big. The first Google index in 1998 already had 26 million pages, and by 2000 the Google index reached the one billion mark. Over the last eight years, we’ve seen a lot of big numbers about how much content is really out there. Recently, even our search engineers stopped in awe about just how big the web is these days — when our systems that process links on the web to find new content hit a milestone: 1 trillion (as in 1,000,000,000,000) unique URLs on the web at once!

How do we find all those pages? We start at a set of well-connected initial pages and follow each of their links to new pages. Then we follow the links on those new pages to even more pages and so on, until we have a huge list of links. In fact, we found even more than 1 trillion individual links, but not all of them lead to unique web pages. Many pages have multiple URLs with exactly the same content or URLs that are auto-generated copies of each other. Even after removing those exact duplicates, we saw a trillion unique URLs, and the number of individual web pages out there is growing by several billion pages per day.

So how many unique pages does the web really contain? We don’t know; we don’t have time to look at them all! :-) Strictly speaking, the number of pages out there is infinite — for example, web calendars may have a “next day” link, and we could follow that link forever, each time finding a “new” page. We’re not doing that, obviously, since there would be little benefit to you. But this example shows that the size of the web really depends on your definition of what’s a useful page, and there is no exact answer…

Sauce for the gander…

If you’ve been following Associated Press’s absurd attempt to rewrite the law on Fair Use then you’ll enjoy Michael Arrington’s latest sally.

As far as I can tell, the Associated Press is sticking by its ridiculous and unlawful assertion that “direct quotations, even short ones” are copyright infringements and result in lawsuit threats and DMCA takedown notices.

This story led us to ban the A.P., call the New York Times out on undisclosed conflicts of interest and begin to investigate some ridiculous organization called the Media Bloggers Association before getting bored and wandering off to other topics.

But now the A.P. has gone too far. They’ve quoted twenty-two words from one of our posts, in clear violation of their warped interpretation of copyright law. The offending quote, from this post, is here (I’m suspending my A.P. ban to report on this important story).

Am I being ridiculous? Absolutely. But the point is to illustrate that the A.P. is taking an absurd and indefensible position, too. So I’ve called my lawyers (really) and have asked them to deliver a DMCA takedown demand to the A.P. And I will also be sending them a bill for $12.50 with that letter, which is exactly what the A.P. would have charged me if I published a 22 word quote from one of their articles.

Next time, A.P., ask permission before you quote me. I work hard to create content, and it just isn’t appropriate for you to simply cut and paste it into your own product and then sell that to others.

The Net and the 2008 US Election

Very interesting study from the Pew ‘Internet and American Life’ project which, among many other things, suggests that,

A significant number of voters are also using the internet to gain access to campaign events and primary documents. Some 39% of online Americans have used the internet to access “unfiltered” campaign materials, which includes video of candidate debates, speeches and announcements, as well as position papers and speech transcripts…

Paul Krugman: we’ll all be Grateful Dead one day

From his NYT column

In 1994,… Esther Dyson, made a striking prediction: that the ease with which digital content can be copied and disseminated would eventually force businesses to sell the results of creative activity cheaply, or even give it away. Whatever the product — software, books, music, movies — the cost of creation would have to be recouped indirectly: businesses would have to “distribute intellectual property free in order to sell services and relationships.”

For example, she described how some software companies gave their product away but earned fees for installation and servicing. But her most compelling illustration of how you can make money by giving stuff away was that of the Grateful Dead, who encouraged people to tape live performances because “enough of the people who copy and listen to Grateful Dead tapes end up paying for hats, T-shirts and performance tickets. In the new era, the ancillary market is the market.”

Indeed, it turns out that the Dead were business pioneers. Rolling Stone recently published an article titled “Rock’s New Economy: Making Money When CDs Don’t Sell.” Downloads are steadily undermining record sales — but today’s rock bands, the magazine reports, are finding other sources of income. Even if record sales are modest, bands can convert airplay and YouTube views into financial success indirectly, making money through “publishing, touring, merchandising and licensing.”

What other creative activities will become mainly ways to promote side businesses? How about writing books?

He goes on to argue that — via Kindle-type devices — much the same will happen to book authoring. Hmmm…

Adaptive websites

We’ve become accustomed to websites which affect to ‘personalise’ pages based on the use of cookies. But here’s an interesting piece in Technology Review about technology developed at MIT’s Sloan School of Management which adapts to unknown users within the first few clicks on the website by analyzing each user’s pattern of clicks.

John Hauser, a professor of marketing at the Sloan School and the lead author of a paper on the research that is slated to appear in Marketing Science, explains that a website running the system would detect a user’s cognitive style. It would watch for traits, such as whether or not the user is detail oriented, and morph to complement that style. The changes would be subtle. “Suddenly, you’re finding the website is easy to navigate, more comfortable, and it gives you the information you need,” Hauser says. The user, he says, shouldn’t even realize that the website is personalized.

The researchers built a prototype website for British Telecom, set up to sell broadband plans. The website is designed so that the first few clicks that visitors make are likely to reveal aspects of cognitive style. For example, the initial page that a user sees lets her choose, among other things, to compare plans using a chart or to interact with a broadband advisor. “You can see that someone who’s very analytic is probably more likely to go to ‘compare plans’ than to the direct advisor,” says Hauser. Within about 10 clicks, the system makes a guess at the user’s cognitive style and morphs to fit. “If we determine that you like lots of graphs, you’re going to start seeing lots of graphs,” he says. “If we determine that you like to get advice from peers, you’re going to see lots of advice from peers.”

In addition to guessing at each user’s cognitive style by analyzing that person’s pattern of clicks, the system would track data over time to see which versions of the website work most effectively for which cognitive styles.

Posted in Web