I’ve already run this idea by a couple people, but I thought I’d write about it so there’s some record that it was my damn idea. If you’ve read someone write about this or actually do it already, let me know in the comments.
Tag, You’re It
Many of you are already familiar with tagging, or folksonomies. Popular services use this categorization/organization paradigm to help users manage their content. For example, there’s del.icio.us for bookmarks, Flickr for pictures, 43 Things for goals, and even Technorati for blog posts such as this one. Some of these services are even beginning to list content from these other services when browsing tagged content.
My bright idea is that instead of relying on any of these services showing the relevant data from other sites, you use an independent tag aggregator. That’s where I step in. Exact implementation details aside, I want to provide an interface to browsing tagged data from every site possible. I don’t want to restrict the data to the several already popular sites, but also include results from lesser known services, such as Tagsurf, the tagged message board.
It could provide a common interface, allow for different stylesheets, maybe even have user profiles to retain certain browsing/searching preferences. All these ideas are very free form right now, but I’d like some feedback on what you’d like to see.
What’s the Problem, Bub?
My real hangup about this idea is several fold. They all basically relate to popularity if this does catch on like I expect it would.
First, bandwidth could be a real concern. My site barely uses any now, but at the same time, I don’t have the largest readership (I still haven’t found a good way to measure it, so maybe I’m wrong). Many sites that provide even small sized content (like just text) have issues with this due to popularity. Take a look at DrunkenBlog.
The second issue is that of implementation. Should I use PHP? Maybe Python? What about caching? I don’t know. It also depends on how hard it is to interact with the various APIs that these services provide. There is no universal API for these types of services (which is in itself a shame). XML-RPC, REST, SOAP…. I’ll be knee deep in alphabet soup. I could (and will) take on help should anyone want to help. The problem then becomes how to manage the software. I’ll be honest that I’ve never set up a version control repository. Do I want to make it open source or do I want to try to be greedy and keep the code to myself?
The third issue really troubling me is that of caching. Most of these services are concerned about bandwidth (and processor time) as well, requesting that anyone using their services don’t abuse them. They kindly, and rightly, request that some kind of caching be used. The question becomes how should this be done? I could store a cache file for each tag (up to a certain predetermined number of items), but will that horribly slow the site? Could a MySQL database help with that? I don’t know. Another issue is how much space will it take up. Could I run out of space at my host?
A fourth issue is that of funding. Ideally, I’d add something like Google’s AdWords in a sidebar. However, will all the services I will be tapping with this be cool with that? Will they want some kind of service charge or kickback?
I have lots of questions and no real answers, so please leave any thoughts you have.
A recent Robot Co-Op broached the issue of API access to their service. Should they provide facilities for members to manipulate their data externally, I’d like to make a Mac OS X client for it, a la Cocoalicious or Flickr Export for iPhoto. It’ll be good practice.