Archive for the 'Web' Category

Internet memes

[link] Tuesday, August 5th, 2008

Michael has found a timeline of memes. Hooray! Now, instead of having to embark on long explanations to sceptical interrogators, I can simply point them to it.

What’s a meme? Wikipedia says the term “denotes any learned feeling, thought or behavior. Examples include thoughts, ideas, theories, practices, habits, songs, dances and moods. Memes propagate themselves and can move through a sociological ‘culture’ in a manner similar to the behavior of a virus”.

I prefer to say that a meme is an infectious idea.

How big is the web?

[link] Saturday, July 26th, 2008

Nobody really knows, but here is an interesting post on the Official Google Blog…

We’ve known it for a long time: the web is big. The first Google index in 1998 already had 26 million pages, and by 2000 the Google index reached the one billion mark. Over the last eight years, we’ve seen a lot of big numbers about how much content is really out there. Recently, even our search engineers stopped in awe about just how big the web is these days — when our systems that process links on the web to find new content hit a milestone: 1 trillion (as in 1,000,000,000,000) unique URLs on the web at once!

How do we find all those pages? We start at a set of well-connected initial pages and follow each of their links to new pages. Then we follow the links on those new pages to even more pages and so on, until we have a huge list of links. In fact, we found even more than 1 trillion individual links, but not all of them lead to unique web pages. Many pages have multiple URLs with exactly the same content or URLs that are auto-generated copies of each other. Even after removing those exact duplicates, we saw a trillion unique URLs, and the number of individual web pages out there is growing by several billion pages per day.

So how many unique pages does the web really contain? We don’t know; we don’t have time to look at them all! :-) Strictly speaking, the number of pages out there is infinite — for example, web calendars may have a “next day” link, and we could follow that link forever, each time finding a “new” page. We’re not doing that, obviously, since there would be little benefit to you. But this example shows that the size of the web really depends on your definition of what’s a useful page, and there is no exact answer…

Sauce for the gander…

[link] Saturday, June 21st, 2008

If you’ve been following Associated Press’s absurd attempt to rewrite the law on Fair Use then you’ll enjoy Michael Arrington’s latest sally.

As far as I can tell, the Associated Press is sticking by its ridiculous and unlawful assertion that “direct quotations, even short ones” are copyright infringements and result in lawsuit threats and DMCA takedown notices.

This story led us to ban the A.P., call the New York Times out on undisclosed conflicts of interest and begin to investigate some ridiculous organization called the Media Bloggers Association before getting bored and wandering off to other topics.

But now the A.P. has gone too far. They’ve quoted twenty-two words from one of our posts, in clear violation of their warped interpretation of copyright law. The offending quote, from this post, is here (I’m suspending my A.P. ban to report on this important story).

Am I being ridiculous? Absolutely. But the point is to illustrate that the A.P. is taking an absurd and indefensible position, too. So I’ve called my lawyers (really) and have asked them to deliver a DMCA takedown demand to the A.P. And I will also be sending them a bill for $12.50 with that letter, which is exactly what the A.P. would have charged me if I published a 22 word quote from one of their articles.

Next time, A.P., ask permission before you quote me. I work hard to create content, and it just isn’t appropriate for you to simply cut and paste it into your own product and then sell that to others.

The Net and the 2008 US Election

[link] Tuesday, June 17th, 2008

Very interesting study from the Pew ‘Internet and American Life’ project which, among many other things, suggests that,

A significant number of voters are also using the internet to gain access to campaign events and primary documents. Some 39% of online Americans have used the internet to access “unfiltered” campaign materials, which includes video of candidate debates, speeches and announcements, as well as position papers and speech transcripts…

Paul Krugman: we’ll all be Grateful Dead one day

[link] Monday, June 9th, 2008

From his NYT column

In 1994,… Esther Dyson, made a striking prediction: that the ease with which digital content can be copied and disseminated would eventually force businesses to sell the results of creative activity cheaply, or even give it away. Whatever the product — software, books, music, movies — the cost of creation would have to be recouped indirectly: businesses would have to “distribute intellectual property free in order to sell services and relationships.”

For example, she described how some software companies gave their product away but earned fees for installation and servicing. But her most compelling illustration of how you can make money by giving stuff away was that of the Grateful Dead, who encouraged people to tape live performances because “enough of the people who copy and listen to Grateful Dead tapes end up paying for hats, T-shirts and performance tickets. In the new era, the ancillary market is the market.”

Indeed, it turns out that the Dead were business pioneers. Rolling Stone recently published an article titled “Rock’s New Economy: Making Money When CDs Don’t Sell.” Downloads are steadily undermining record sales — but today’s rock bands, the magazine reports, are finding other sources of income. Even if record sales are modest, bands can convert airplay and YouTube views into financial success indirectly, making money through “publishing, touring, merchandising and licensing.”

What other creative activities will become mainly ways to promote side businesses? How about writing books?

He goes on to argue that — via Kindle-type devices — much the same will happen to book authoring. Hmmm…

PicLens

[link] Monday, June 9th, 2008

If you run Firefox and are interested in photography, then PicLens is an astonishing plug-in. Try it.

Adaptive websites

[link] Monday, June 9th, 2008

We’ve become accustomed to websites which affect to ‘personalise’ pages based on the use of cookies. But here’s an interesting piece in Technology Review about technology developed at MIT’s Sloan School of Management which adapts to unknown users within the first few clicks on the website by analyzing each user’s pattern of clicks.

John Hauser, a professor of marketing at the Sloan School and the lead author of a paper on the research that is slated to appear in Marketing Science, explains that a website running the system would detect a user’s cognitive style. It would watch for traits, such as whether or not the user is detail oriented, and morph to complement that style. The changes would be subtle. “Suddenly, you’re finding the website is easy to navigate, more comfortable, and it gives you the information you need,” Hauser says. The user, he says, shouldn’t even realize that the website is personalized.

The researchers built a prototype website for British Telecom, set up to sell broadband plans. The website is designed so that the first few clicks that visitors make are likely to reveal aspects of cognitive style. For example, the initial page that a user sees lets her choose, among other things, to compare plans using a chart or to interact with a broadband advisor. “You can see that someone who’s very analytic is probably more likely to go to ‘compare plans’ than to the direct advisor,” says Hauser. Within about 10 clicks, the system makes a guess at the user’s cognitive style and morphs to fit. “If we determine that you like lots of graphs, you’re going to start seeing lots of graphs,” he says. “If we determine that you like to get advice from peers, you’re going to see lots of advice from peers.”

In addition to guessing at each user’s cognitive style by analyzing that person’s pattern of clicks, the system would track data over time to see which versions of the website work most effectively for which cognitive styles.

On this day…

[link] Wednesday, April 30th, 2008

… in 1993, CERN put the Web into the public domain. See the original document here.

US Department of Justice banned from Wikipedia

[link] Wednesday, April 30th, 2008

Interesting story

Wikipedia has temporarily blocked edits from the US Department of Justice after someone inside the government agency tried to erase references to a particularly-controversial Wiki-scandal.

Early last week, the Boston-based Committee for Accuracy in Middle East Reporting in America (CAMERA) was accused of organizing a secret campaign to influence certain articles on the “free encyclopedia anyone can edit”. Just days later, the DoJ’s IP range was used to edit the site’s entry on the Pro-Israel “media-monitoring group,” lifting a new section that detailed the controversy.

The DoJ did not respond to our requests for comment. But odds are, the edits were made by a single individual acting independently. Wikipedia’s ban on the department’s IP is due to be lifted today…

The earliest days of the Web

[link] Friday, April 4th, 2008

Fascinating video by Robert Scoble in which Ben Segal, who had been network manager at CERN during Tim Berners-Lee’s time there, talks about the invention of the Web. For all CEOs of large companies, and all Directors of large institutions, there’s a very important message roughly six minutes into the tape.