Twedar - machine learning side project
Twedar is a little side-project I built about two years back with Philip Cole. It's a live-updated Twitter search, with the power to filter out any noise from unrelated subjects that match your keywords. If you're interested in the Ruby programming language and search Twitter for 'Ruby' you'll get loads of unrelated tweets about rubies in jewellery, people called ruby, ruby in song lyrics, etc. Twedar lets you cut them out.
Behinds the scenes is a really simple version of a technology like the spam filters in Gmail that stops you hearing about all those anatomy enhancement products.
It was fun to build, but we never released it properly! So I'm doing that now - have a play and tell us what you think.
Filtering tweets
Twedar uses a naive bayes classifier. Bayes rule is a lovely little idea that's surprisingly intuitive. Here's a presentation I made about using Bayesian filtering for classifications; there are definitely better resources out there!
Have a look at the code
Read through the code - it was the first real node app I wrote. The front-end is written in Backbone & Rails.