June 25, 2010 at 11:39 PM | categories: twitter, network, infochimps | View Comments

recently signed up to the infochimps api and wanted to do something quick and dirty to get a feel for it.so here's a little experimentget the people i follow on twitterlook up the words that "represent" them according to the infochimps word bag apibuild a similiarity matrix based on the common use of those termsplot the connectivity for the top 30 or so pairingsit's basically partitioned into three groups...veztek (my boss john) and smcinnes (steve from the lonely planet community team) in the top righta big clump of nosqlness with mongodb - hbase - jpatanooga - kevinweil in the bottom...

old projects...

latent semantic analysis via the singular value decomposition (for dummies)
semi supervised naive bayes
statistical synonyms
round the world tweets
decomposing social graphs on twitter
do it yourself statistically improbable phrases
should i burn it?
the median of a trillion numbers
deduping with resemblance metrics
simple supervised learning / should i read it?
audioscrobbler experiments
chaoscope experiment

brain of mat kelcey

friend clustering by term usage