<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>brain of matpalm &#187; machine learning</title>
	<atom:link href="http://matpalm.com/blog/tag/machine-learning/feed/" rel="self" type="application/rss+xml" />
	<link>http://matpalm.com/blog</link>
	<description>thoughts from a data scientist wannabe</description>
	<lastBuildDate>Mon, 16 Aug 2010 11:38:22 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>my list of cool machine learning books</title>
		<link>http://matpalm.com/blog/2010/08/06/my-list-of-cool-machine-learning-books/</link>
		<comments>http://matpalm.com/blog/2010/08/06/my-list-of-cool-machine-learning-books/#comments</comments>
		<pubDate>Fri, 06 Aug 2010 08:35:20 +0000</pubDate>
		<dc:creator>matpalm</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[books]]></category>
		<category><![CDATA[machine learning]]></category>

		<guid isPermaLink="false">http://matpalm.com/blog/?p=746</guid>
		<description><![CDATA[for the last month or so i&#8217;ve had my head down and have been focusing more on theory (ie reading) than on practice (ie coding)
so rather than write no blog post here&#8217;s mats-list-of-cool-machine-learning-books in the order i think you should consider reading them&#8230;

1) &#8220;programming collective intelligence&#8221; by toby segaran




if you know nothing about machine learning [...]]]></description>
			<content:encoded><![CDATA[<p>for the last month or so i&#8217;ve had my head down and have been focusing more on theory (ie reading) than on practice (ie coding)</p>
<p>so rather than write no blog post here&#8217;s mats-list-of-cool-machine-learning-books in the order i think you should consider reading them&#8230;</p>
<p><span id="more-746"></span></p>
<h2>1) &#8220;<a href="http://amzn.to/a8iq8U" target="_blank">programming collective intelligence</a>&#8221; by toby segaran</h2>
<table>
<tbody style="vertical-align: top;">
<tr>
<td><img class="alignnone size-full wp-image-689" title="pci" src="http://matpalm.com/blog/wp-content/uploads/2010/08/pci.jpg" alt="" width="200" height="215" /></td>
<td>if you know nothing about machine learning and haven&#8217;t done maths since high school then this is the book for you.</p>
<p>it&#8217;s a fantastically accesible introduction to the field. includes almost no theory and explains algorithms using actual python implementations.</td>
</tr>
</tbody>
</table>
<h2>2) &#8220;<a href="http://amzn.to/cvFi7t" target="_blank">data mining</a>&#8221; by witten and frank</h2>
<table>
<tbody style="vertical-align: top;">
<tr>
<td><img class="alignnone size-full wp-image-684" src="http://matpalm.com/blog/wp-content/uploads/2010/08/dm1.jpg" alt="" width="200" height="215" /></td>
<td>this book covers quite a bit more than programming c.i. while still being extremely practical (ie very few formula).</p>
<p>about a fifth of the book is dedicated to weka, a machine learning workbench which was written by the authors. apart from the weka section this book has no code. i made <a href="http://vimeo.com/13051595">a little screencast on weka</a> awhile back if you&#8217;re after a summary.</td>
</tr>
</tbody>
</table>
<h2>3) &#8220;<a href="http://amzn.to/b8gp6U" target="_blank">introduction to data mining</a>&#8221; by tan, steinbach and kumar</h2>
<table>
<tbody style="vertical-align: top;">
<tr>
<td><img class="alignnone size-full wp-image-687" src="http://matpalm.com/blog/wp-content/uploads/2010/08/itdm.jpg" alt="" width="200" height="215" /></td>
<td>covers almost the same material as the witten/frank text but delves a little bit deeper and with more rigour. includes no code (none of the books do from now on) with algorithms described by formula.</p>
<p>has a number of appendices on linear algebra, probability, statistics etc so that you can read up if you&#8217;re a bit rusty or new to the fields (the witten/frank text lack these).</p>
<p>some people might argue having both of these books is a waste since they cover so much of the same ground but i&#8217;ve always found multiple explanations from different authors to be a great way to help understand a topic. i read the witten/frank text first and am glad i did but if i could only keep one i&#8217;d keep this one.</td>
</tr>
</tbody>
</table>
<h2>intermission</h2>
<p>at this point you&#8217;ve probably got enough mental firepower to handle some of the uni level machine learning course notes that are floating about online.</p>
<p>if you&#8217;re keen to get a better foundation of the maths side of things it&#8217;d be worth working through <a href="http://www.youtube.com/watch?v=UzxYlbK2c7E">andrew ng&#8217;s lecture series on machine learning.</a> (20 hours of a second year stanford course on machine learning)</p>
<p>i also found <a href="http://www.cs.cmu.edu/~awm/">andrew moore&#8217;s lecture slides</a> really great. (they do though require a reasonable understanding of the basics)</p>
<h2>4) &#8220;<a href="http://amzn.to/atpHZ2" target="_blank">foundations of statistical natural language processing</a>&#8221; by manning and schutze</h2>
<table>
<tbody style="vertical-align: top;">
<tr>
<td><img class="alignnone size-full wp-image-686" title="fosnlp" src="http://matpalm.com/blog/wp-content/uploads/2010/08/fosnlp.jpg" alt="" width="200" height="215" /></td>
<td>not a machine learning book as such but great for learning to deal with one of the most common types of data around; text. since most of machine learning theory is about maths (ie numbers) this is awesome in helping to understanding how to deal with text in a mathematical context.</td>
</tr>
</tbody>
</table>
<h2>5) &#8220;<a href="http://amzn.to/99UJfV" target="_blank">introduction to machine learning</a>&#8221; by ethem alpaydin</h2>
<table>
<tbody style="vertical-align: top;">
<tr>
<td><img class="alignnone size-full wp-image-686" src="http://matpalm.com/blog/wp-content/uploads/2010/08/itml.jpg" alt="" width="200" height="215" /></td>
<td>covers generally the same sort of topics as the data mining books but with much more rigour and theory (derivations, proofs, etc). i think this is a good thing though since understanding how things work at a low level gives you the ability to tweak and modify as required.</p>
<p>loads more formulas but again with appendixs that introduce the basics in enough detail to get by.</td>
</tr>
</tbody>
</table>
<h2>6) &#8220;<a href="http://amzn.to/ap9Kgf" target="_blank">all of statistics</a>&#8221; by larry wasserman</h2>
<table>
<tbody style="vertical-align: top;">
<tr>
<td><img class="alignnone size-full wp-image-686" src="http://matpalm.com/blog/wp-content/uploads/2010/08/aos.jpg" alt="" /></td>
<td>by this stage you&#8217;ll probably have an appreciation of how important statistics is for this domain and it might be worth foccussing on it for a bit.</p>
<p>personally i found this book to be a great read and though i&#8217;ve only read certain sections in depth i&#8217;m looking forward to when i get a chance to work through it cover to cover</td>
</tr>
</tbody>
</table>
<h2>7) &#8220;the elements of statistical learning&#8221; by hastie, tibshirani and friedman.</h2>
<table>
<tbody style="vertical-align: top;">
<tr>
<td><img class="alignnone size-full wp-image-686" src="http://matpalm.com/blog/wp-content/uploads/2010/08/eosl.jpg" alt="" /></td>
<td>with a bit more stats under your belt you might have a chance of getting through this one; the most complex of the lot.</p>
<p>this book is absolutely beautifully presented and now that it&#8217;s <a href="http://www-stat.stanford.edu/~tibs/ElemStatLearn/">FREE to download</a> you&#8217;ve got no reason not to have a crack at it.</p>
<p>a remarkable piece of work and one i&#8217;ve yet to get through fully cover to cover, it&#8217;s quite hardcore and right on the border of my level of understanding ( which makes it perfect for me :P )</td>
</tr>
</tbody>
</table>
<h2>ps. books i haven&#8217;t read that are in the mail</h2>
<h2>&#8220;<a href="http://amzn.to/dkcGxb" target="_blank">machine learning</a>&#8221; by tom mitchell</h2>
<table>
<tbody style="vertical-align: top;">
<tr>
<td><img class="alignnone size-full wp-image-686" src="http://matpalm.com/blog/wp-content/uploads/2010/08/ml.jpg" alt="" /></td>
<td>have been wanting to read this one for awhile, i&#8217;m a big fan of <a href="http://www.cs.cmu.edu/~tom/">tom mitchell</a>, but couldn&#8217;t justify the cost</p>
<p>however just found out the other day the paperback is a third of the price of the hardback i was looking at!! the book&#8217;s in the mail</td>
</tr>
</tbody>
</table>
<h2><a href="http://amzn.to/9IzWtN" target="_blank">&#8220;pattern recognition and machine learning</a>&#8221; by chris bishop</h2>
<table>
<tbody style="vertical-align: top;">
<tr>
<td><img class="alignnone size-full wp-image-686" src="http://matpalm.com/blog/wp-content/uploads/2010/08/prml.jpg" alt="" /></td>
<td>all of a sudden seemed like everyone was reading this but me so it was time to jump on the bandwagon</td>
</tr>
</tbody>
</table>
]]></content:encoded>
			<wfw:commentRss>http://matpalm.com/blog/2010/08/06/my-list-of-cool-machine-learning-books/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>brutally short intro to weka</title>
		<link>http://matpalm.com/blog/2010/07/03/brutally-short-intro-to-weka/</link>
		<comments>http://matpalm.com/blog/2010/07/03/brutally-short-intro-to-weka/#comments</comments>
		<pubDate>Sat, 03 Jul 2010 07:35:27 +0000</pubDate>
		<dc:creator>matpalm</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[brutally short intro]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[weka]]></category>

		<guid isPermaLink="false">http://matpalm.com/blog/?p=677</guid>
		<description><![CDATA[weka is a java based machine learning workbench that i&#8217;ve found useful to playing with to help understand some standard machine learning algorithms. in this quick demo i show how to build a classifier for three simple datasets; two of which address the basics of text classification

brutally short intro to weka from Mat Kelcey on [...]]]></description>
			<content:encoded><![CDATA[<p>weka is a java based machine learning workbench that i&#8217;ve found useful to playing with to help understand some standard machine learning algorithms. in this quick demo i show how to build a classifier for three simple datasets; two of which address the basics of text classification</p>
<p><object width="400" height="300"><param name="allowfullscreen" value="true" /><param name="allowscriptaccess" value="always" /><param name="movie" value="http://vimeo.com/moogaloop.swf?clip_id=13051595&amp;server=vimeo.com&amp;show_title=1&amp;show_byline=1&amp;show_portrait=0&amp;color=&amp;fullscreen=1" /><embed src="http://vimeo.com/moogaloop.swf?clip_id=13051595&amp;server=vimeo.com&amp;show_title=1&amp;show_byline=1&amp;show_portrait=0&amp;color=&amp;fullscreen=1" type="application/x-shockwave-flash" allowfullscreen="true" allowscriptaccess="always" width="400" height="300"></embed></object>
<p><a href="http://vimeo.com/13051595">brutally short intro to weka</a> from <a href="http://vimeo.com/user2935988">Mat Kelcey</a> on <a href="http://vimeo.com">Vimeo</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://matpalm.com/blog/2010/07/03/brutally-short-intro-to-weka/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>an intro to semi supervised document classification</title>
		<link>http://matpalm.com/blog/2010/01/31/an-intro-to-semi-supervised-document-classification/</link>
		<comments>http://matpalm.com/blog/2010/01/31/an-intro-to-semi-supervised-document-classification/#comments</comments>
		<pubDate>Sun, 31 Jan 2010 04:02:33 +0000</pubDate>
		<dc:creator>matpalm</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[naive bayes]]></category>
		<category><![CDATA[semi supervised]]></category>

		<guid isPermaLink="false">http://matpalm.com/blog/?p=275</guid>
		<description><![CDATA[here&#8217;s a great lecture from tom mitchell about document classification using a semi supervised version of naive bayes.
semi supervised algorithms only require some of the training examples to be labeled and are able to make use of any unlabelled ones, very common when we have a huge corpus.
i&#8217;ve started an experiment brewing to test this [...]]]></description>
			<content:encoded><![CDATA[<p>here&#8217;s a great <a href="http://videolectures.net/mlas06_mitchell_sla/">lecture</a> from <a href="http://www.cs.cmu.edu/~tom/">tom mitchell</a> about document classification using a semi supervised version of naive bayes.</p>
<p>semi supervised algorithms only require some of the training examples to be labeled and are able to make use of any unlabelled ones, very common when we have a huge corpus.</p>
<p>i&#8217;ve started an experiment brewing to test this out by porting some <a href="http://matpalm.com/rss.feed/p3/">previous naive bayes work</a> i did to use this semi supervised scheme and will published it when it&#8217;s done.</p>
<p>cool stuff!!</p>
]]></content:encoded>
			<wfw:commentRss>http://matpalm.com/blog/2010/01/31/an-intro-to-semi-supervised-document-classification/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>do a degree via youtube</title>
		<link>http://matpalm.com/blog/2009/10/01/do-a-degree-via-youtube/</link>
		<comments>http://matpalm.com/blog/2009/10/01/do-a-degree-via-youtube/#comments</comments>
		<pubDate>Thu, 01 Oct 2009 10:40:39 +0000</pubDate>
		<dc:creator>matpalm</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[lectures]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[stanford]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://matpalm.com/blog/?p=75</guid>
		<description><![CDATA[i&#8217;m amazed by how much great content is on youtube, how could you NOT learn something!?
13 x 1hr Statistical Aspects of Data Mining (Stats 202)
20 x 1hr Machine Learning
]]></description>
			<content:encoded><![CDATA[<p>i&#8217;m amazed by how much great content is on youtube, how could you NOT learn something!?</p>
<p><a href="http://www.youtube.com/view_play_list?p=993FF1801B5AAB4D&amp;search_query=statistical+aspects+of+data+mining+stats+202">13 x 1hr Statistical Aspects of Data Mining (Stats 202)</a></p>
<p><a href="http://www.youtube.com/view_play_list?p=A89DCFA6ADACE599&amp;search_query=machine+learning">20 x 1hr Machine Learning</a></p>
]]></content:encoded>
			<wfw:commentRss>http://matpalm.com/blog/2009/10/01/do-a-degree-via-youtube/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
