<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>brain of matpalm &#187; statistics</title>
	<atom:link href="http://matpalm.com/blog/tag/statistics/feed/" rel="self" type="application/rss+xml" />
	<link>http://matpalm.com/blog</link>
	<description>thoughts from a data scientist wannabe</description>
	<lastBuildDate>Mon, 16 Aug 2010 11:38:22 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>e12.1 statistical synonyms</title>
		<link>http://matpalm.com/blog/2010/01/23/e12-1-statistical-synonyms/</link>
		<comments>http://matpalm.com/blog/2010/01/23/e12-1-statistical-synonyms/#comments</comments>
		<pubDate>Sat, 23 Jan 2010 02:54:43 +0000</pubDate>
		<dc:creator>matpalm</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[e12]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://matpalm.com/blog/?p=250</guid>
		<description><![CDATA[i&#8217;ve had an idea brewing in my head for awhile now seeded by a great talk by peter norvig about statistically approaches to find patterns in data.
one thing he alludes to is the generation of synoyms based on n-gram models.
the basic intuition is this; if a corpus contains occurrences of the phrases &#8216;a x b&#8217; [...]]]></description>
			<content:encoded><![CDATA[<p>i&#8217;ve had an idea brewing in my head for awhile now seeded by <a href="http://www.youtube.com/watch?v=nU8DcBF-qo4">a great talk by peter norvig</a> about statistically approaches to find patterns in data.</p>
<p>one thing he alludes to is the generation of synoyms based on n-gram models.</p>
<p>the basic intuition is this; if a corpus contains occurrences of the phrases &#8216;a x b&#8217; and &#8216;a y b&#8217; then to some degree x and y are synonymous.</p>
<p>the question becomes how do we calculate the strength of the relationship? how is it a function of the frequencies of a, b, x, y, &#8216;a x b&#8217;, &#8216;a y b&#8217;, &#8216;a ? b&#8217; in the corpus. what else can we take into account?</p>
]]></content:encoded>
			<wfw:commentRss>http://matpalm.com/blog/2010/01/23/e12-1-statistical-synonyms/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>simple statistics with R</title>
		<link>http://matpalm.com/blog/2009/10/03/simple-statistics-with-r/</link>
		<comments>http://matpalm.com/blog/2009/10/03/simple-statistics-with-r/#comments</comments>
		<pubDate>Sat, 03 Oct 2009 05:43:59 +0000</pubDate>
		<dc:creator>matpalm</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[language]]></category>
		<category><![CDATA[r]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://matpalm.com/blog/?p=77</guid>
		<description><![CDATA[i&#8217;m learning a new statistics language called R and it&#8217;s pretty cool.
make a vector &#8230;
&#62; c(3,1,4,1,5,9,2,6,5,3,5,8)
 [1] 3 1 4 1 5 9 2 6 5 3 5 8
turn it into a frequency table &#8230;
&#62; table(c(3,1,4,1,5,9,2,6,5,3,5,8))
1 2 3 4 5 6 8 9
2 1 2 1 3 1 1 1
sort by frequency &#8230;
&#62; sort(table(c(3,1,4,1,5,9,2,6,5,3,5,8)))
2 4 [...]]]></description>
			<content:encoded><![CDATA[<p>i&#8217;m learning a new statistics language called R and it&#8217;s pretty cool.</p>
<p>make a vector &#8230;</p>
<pre>&gt; c(3,1,4,1,5,9,2,6,5,3,5,8)
 [1] 3 1 4 1 5 9 2 6 5 3 5 8</pre>
<p>turn it into a frequency table &#8230;</p>
<pre>&gt; table(c(3,1,4,1,5,9,2,6,5,3,5,8))
1 2 3 4 5 6 8 9
2 1 2 1 3 1 1 1</pre>
<p>sort by frequency &#8230;</p>
<pre>&gt; sort(table(c(3,1,4,1,5,9,2,6,5,3,5,8)))
2 4 6 8 9 1 3 5
1 1 1 1 1 2 2 3</pre>
<p>and plot!</p>
<pre>&gt; barplot(sort(table(c(3,1,4,1,5,9,2,6,5,3,5,8))))
<img class="alignnone size-full wp-image-78" title="Rplot" src="http://matpalm.com/blog/wp-content/uploads/2009/10/Rplot.png" alt="Rplot" width="480" height="480" />

so simple!</pre>
]]></content:encoded>
			<wfw:commentRss>http://matpalm.com/blog/2009/10/03/simple-statistics-with-r/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>do a degree via youtube</title>
		<link>http://matpalm.com/blog/2009/10/01/do-a-degree-via-youtube/</link>
		<comments>http://matpalm.com/blog/2009/10/01/do-a-degree-via-youtube/#comments</comments>
		<pubDate>Thu, 01 Oct 2009 10:40:39 +0000</pubDate>
		<dc:creator>matpalm</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[lectures]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[stanford]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://matpalm.com/blog/?p=75</guid>
		<description><![CDATA[i&#8217;m amazed by how much great content is on youtube, how could you NOT learn something!?
13 x 1hr Statistical Aspects of Data Mining (Stats 202)
20 x 1hr Machine Learning
]]></description>
			<content:encoded><![CDATA[<p>i&#8217;m amazed by how much great content is on youtube, how could you NOT learn something!?</p>
<p><a href="http://www.youtube.com/view_play_list?p=993FF1801B5AAB4D&amp;search_query=statistical+aspects+of+data+mining+stats+202">13 x 1hr Statistical Aspects of Data Mining (Stats 202)</a></p>
<p><a href="http://www.youtube.com/view_play_list?p=A89DCFA6ADACE599&amp;search_query=machine+learning">20 x 1hr Machine Learning</a></p>
]]></content:encoded>
			<wfw:commentRss>http://matpalm.com/blog/2009/10/01/do-a-degree-via-youtube/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
