brain of mat kelcey...


first hadoop experiment

September 16, 2009 at 07:26 PM | categories: Uncategorized

just finished my first hadoop experiment.matpalm.com/sipnot fantastic results but heaps of of feedback from hadoop mailing groupmore results coming soon...

how using compressed data can make you app faster

June 28, 2009 at 11:32 AM | categories: Uncategorized

when working with larger data sets (ie more than can fit in memory) there are two important resources to juggle…cpu. how quickly can you process the data.disk io. how quickly can you get data to the cpu.i remember reading once...

erlang profiling

April 22, 2009 at 11:32 AM | categories: Uncategorized

i just found fprof, the erlang profiler by randoming clicking around the erlang man page listtry123fprof:apply(Module, Function, Args).fprof:profile().fprof:analyse().for an interesting breakdown of a call...

bin packing

December 14, 2008 at 11:31 AM | categories: Uncategorized

how to decide what next to backup onto a dvd?when is brute force good enough? will a random walk get a good enough result faster?matpalm.com/burn.it...

the median of a trillion numbers

November 15, 2008 at 11:31 AM | categories: Uncategorized

i got asked in an interview once “how would find the median of a trillion numbers across a thousand machines?”the question has haunted me, until now.here’s my ruby and erlang implementation with a bit of running amazon ec2 thrown in...

fastmap and the jaccard distance

October 31, 2008 at 11:31 AM | categories: Uncategorized

given a set of pairwise distances how do you determine what points correspond to those distances?my latest experiment considers this problem in relation to jaccard distances, a resemblance measure similar to jaccard coefficients used in a previous experimentby using the...

openmp = easy multi threading

October 13, 2008 at 11:30 AM | categories: Uncategorized

openmp is a compiler library, available in gcc since v4.2, for giving hints to a compiler about where code can be parallelized.say we have some code12for(int i=0; i<HUGE_NUMBER; ++i) deadHardCalculation(i)we can make this run on multi threaded by simply...

shingling and the jaccard index

October 06, 2008 at 11:30 AM | categories: Uncategorized

on the weekend i did another experiment using shingling and the jaccard index to try to determine if two sets of data were “duplicates”it works quite well and includes a ruby and c++ version with low level bit operations.project page...

« Previous Page

popular posts...

ensemble nets : training ensembles as a single model using jax on a tpu pod slice(sept 2020)


bnn : counting bees with a rasp pi (may 2018)


drivebot : learning to do laps with reinforcement learning and neural nets (feb 2016)


wikipedia philosophy : do all first links on wikipedia lead to philosophy? (aug 2011)


cartpole++ : deep RL hacking with a complex 3d cart pole environment (aug 2016)


malmomo : deep RL hacking on minecraft with malmo (jan 2017)


some papers from my time at google research / brain...

my honours thesis

the co-evolution of cooperative behaviour (1997) evolving neural nets with genetic algorithms for communication problems.

old projects...