<?xml version="1.0" encoding="UTF-8"?>
 <rss version="2.0" 
    xmlns:content="http://purl.org/rss/1.0/modules/content/"
    xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/">
<channel>
	<title>Modeling with Data: the blog</title>
	<link>http://modelingwithdata.org/</link>
	<description>On statistical computing</description>
	<language>en</language>

<item>
        <title>Testing the model using the model</title>
	<content:encoded>
I start by picking at a cartoonist, but this goes well beyond that in importance.
</content:encoded>
        <link>http://modelingwithdata.org/arch/00000034.htm</link>
        <guid isPermaLink="false">http://modelingwithdata.org/arch/00000034.htm</guid>
	<dc:creator>Ben Klemens</dc:creator>
        <pubDate>11 January 2010 00:00:00 +0000</pubDate>
</item>

<item>
        <title>Keeping paper current</title>
	<content:encoded>
In which I put an absolutely huge amount of thought into things most folks don't really care about. Then, I exert great effort fighting against the inevitable.
</content:encoded>
        <link>http://modelingwithdata.org/arch/00000033.htm</link>
        <guid isPermaLink="false">http://modelingwithdata.org/arch/00000033.htm</guid>
	<dc:creator>Ben Klemens</dc:creator>
        <pubDate>9 January 2010 00:00:00 +0000</pubDate>
</item>

<item>
        <title>The statistics style report</title>
	<content:encoded>
I like to think that I always lean toward parsimony, but I imagine others looking at my work wouldn't think so.
</content:encoded>
        <link>http://modelingwithdata.org/arch/00000032.htm</link>
        <guid isPermaLink="false">http://modelingwithdata.org/arch/00000032.htm</guid>
	<dc:creator>Ben Klemens</dc:creator>
        <pubDate>19 December 2009 00:00:00 +0000</pubDate>
</item>

<item>
        <title>Git status interactive</title>
	<content:encoded>
This is a useful script for people using git; if that's not you, don't bother clicking through. I know I've been writing a lot about computing technique lately, so next time: statistics!
</content:encoded>
        <link>http://modelingwithdata.org/arch/00000031.htm</link>
        <guid isPermaLink="false">http://modelingwithdata.org/arch/00000031.htm</guid>
	<dc:creator>Ben Klemens</dc:creator>
        <pubDate>12 December 2009 00:00:00 +0000</pubDate>
</item>

<item>
        <title>The schism, or why C and C++ are different</title>
	<content:encoded>
Take this as a solicitation for suggestions on how we can get Apophenia's variable-length lists of named arguments (which are entirely C standards-compliant) to work from C++.
</content:encoded>
        <link>http://modelingwithdata.org/arch/00000030.htm</link>
        <guid isPermaLink="false">http://modelingwithdata.org/arch/00000030.htm</guid>
	<dc:creator>Ben Klemens</dc:creator>
        <pubDate>21 November 2009 00:00:00 +0000</pubDate>
</item>

<item>
        <title>Yet another Git tutorial</title>
	<content:encoded>
I wrote this for my coworkers, upon whom I've just dropped this whole revision control thing, because the existing tutorials didn't seem at all appropriate for them.
</content:encoded>
        <link>http://modelingwithdata.org/arch/00000029.htm</link>
        <guid isPermaLink="false">http://modelingwithdata.org/arch/00000029.htm</guid>
	<dc:creator>Ben Klemens</dc:creator>
        <pubDate>8 November 2009 00:00:00 +0000</pubDate>
</item>

<item>
        <title>Easy re-typing with designated initializers</title>
	<content:encoded>
A straight-up lesson on coding technique. Some of this should have been in the texbook, but I was somehow reluctant to rely too much on designated initializers and compound literals; in retrospect, I'm not sure why.
</content:encoded>
        <link>http://modelingwithdata.org/arch/00000028.htm</link>
        <guid isPermaLink="false">http://modelingwithdata.org/arch/00000028.htm</guid>
	<dc:creator>Ben Klemens</dc:creator>
        <pubDate>1 November 2009 00:00:00 +0000</pubDate>
</item>

<item>
        <title>Computing history and its scars</title>
	<content:encoded>
Although it may seem like I'm complaining about whatever glitches, in an alternate universe we'd just have different scars. The path-dependency and potentially infinite life of code means that software is inherently prone to scarring.
</content:encoded>
        <link>http://modelingwithdata.org/arch/00000027.htm</link>
        <guid isPermaLink="false">http://modelingwithdata.org/arch/00000027.htm</guid>
	<dc:creator>Ben Klemens</dc:creator>
        <pubDate>19 October 2009 00:00:00 +0000</pubDate>
</item>

<item>
        <title>Causal ethics</title>
	<content:encoded>
Yes, this is a blog about statistics, and yes, this is a post about the development of ethics.
</content:encoded>
        <link>http://modelingwithdata.org/arch/00000026.htm</link>
        <guid isPermaLink="false">http://modelingwithdata.org/arch/00000026.htm</guid>
	<dc:creator>Ben Klemens</dc:creator>
        <pubDate>11 October 2009 00:00:00 +0000</pubDate>
</item>

<item>
        <title>Programming your blog</title>
	<content:encoded>
When I'm having trouble with a machine of any sort, I try to think of it as a cultural problem. How were the designers thinking differently from how I'm thinking?
</content:encoded>
        <link>http://modelingwithdata.org/arch/00000025.htm</link>
        <guid isPermaLink="false">http://modelingwithdata.org/arch/00000025.htm</guid>
	<dc:creator>Ben Klemens</dc:creator>
        <pubDate>21 July 2009 00:00:00 +0000</pubDate>
</item>

<item>
        <title>Probability versus likelihood</title>
	<content:encoded>
The question of whether data is plural or a singular can be reasoned through; the question here is well on its way to unanswerable.
</content:encoded>
        <link>http://modelingwithdata.org/arch/00000024.htm</link>
        <guid isPermaLink="false">http://modelingwithdata.org/arch/00000024.htm</guid>
	<dc:creator>Ben Klemens</dc:creator>
        <pubDate>2 July 2009 00:00:00 +0000</pubDate>
</item>

<item>
        <title>Data is typically not a plural</title>
	<content:encoded>
This is a lead-in to next episode about another vocabulary problem that's much more difficult. 
</content:encoded>
        <link>http://modelingwithdata.org/arch/00000023.htm</link>
        <guid isPermaLink="false">http://modelingwithdata.org/arch/00000023.htm</guid>
	<dc:creator>Ben Klemens</dc:creator>
        <pubDate>10 June 2009 00:00:00 +0000</pubDate>
</item>

<item>
        <title>Better variadic functions in C</title>
	<content:encoded>
A new edition of the C standard will only come in the distant
future (perhaps never). In the mean time, here's my best solution
to what I see as the language's most glaring problem. I'm open to suggestions on streamlining it further.
</content:encoded>
        <link>http://modelingwithdata.org/arch/00000022.htm</link>
        <guid isPermaLink="false">http://modelingwithdata.org/arch/00000022.htm</guid>
	<dc:creator>Ben Klemens</dc:creator>
        <pubDate>3 June 2009 00:00:00 +0000</pubDate>
</item>

<item>
        <title>Alternatives to Word</title>
	<content:encoded>
In two episodes, I'll tell you what I use for this here web site.
</content:encoded>
        <link>http://modelingwithdata.org/arch/00000021.htm</link>
        <guid isPermaLink="false">http://modelingwithdata.org/arch/00000021.htm</guid>
	<dc:creator>Ben Klemens</dc:creator>
        <pubDate>30 May 2009 00:00:00 +0000</pubDate>
</item>

<item>
        <title>Word and standards</title>
	<content:encoded>
I once toyed with writing a book about computing standards, as a follow-up to my exciting book on software patents.
</content:encoded>
        <link>http://modelingwithdata.org/arch/00000020.htm</link>
        <guid isPermaLink="false">http://modelingwithdata.org/arch/00000020.htm</guid>
	<dc:creator>Ben Klemens</dc:creator>
        <pubDate>22 May 2009 00:00:00 +0000</pubDate>
</item>

<item>
        <title>The ergonomics of the down arrow</title>
	<content:encoded>
This one may seem odd, because it's not a subject many people (including designers) spend much time talking about, but it can still make a big difference, and points to the value of at least having a choice.
</content:encoded>
        <link>http://modelingwithdata.org/arch/00000019.htm</link>
        <guid isPermaLink="false">http://modelingwithdata.org/arch/00000019.htm</guid>
	<dc:creator>Ben Klemens</dc:creator>
        <pubDate>19 May 2009 00:00:00 +0000</pubDate>
</item>

<item>
        <title>Intuition versus ease of use</title>
	<content:encoded>
Ease of initial use is a distinct concept from ease of long-term use. Sometimes they are in happy agreement, and sometimes the first blocks the second.
</content:encoded>
        <link>http://modelingwithdata.org/arch/00000018.htm</link>
        <guid isPermaLink="false">http://modelingwithdata.org/arch/00000018.htm</guid>
	<dc:creator>Ben Klemens</dc:creator>
        <pubDate>15 May 2009 00:00:00 +0000</pubDate>
</item>

<item>
        <title>Views on a paradigm</title>
	<content:encoded>
Part two of six on efficient document prep techniques that are precluded by Word's design.
Don't forget: if you don't have time in front of the computer to read this web page, you can print the PDF.
</content:encoded>
        <link>http://modelingwithdata.org/arch/00000017.htm</link>
        <guid isPermaLink="false">http://modelingwithdata.org/arch/00000017.htm</guid>
	<dc:creator>Ben Klemens</dc:creator>
        <pubDate>11 May 2009 00:00:00 +0000</pubDate>
</item>

<item>
        <title>Why Word is a terrible program</title>
	<content:encoded>
This is an essay from late 2005, in six parts. I'm reprinting it here
because I think it's a good vehicle to get across a number of points about design of information and software.  
</content:encoded>
        <link>http://modelingwithdata.org/arch/00000016.htm</link>
        <guid isPermaLink="false">http://modelingwithdata.org/arch/00000016.htm</guid>
	<dc:creator>Ben Klemens</dc:creator>
        <pubDate>7 May 2009 00:00:00 +0000</pubDate>
</item>

<item>
        <title>A general model object</title>
	<content:encoded>
Last time, I talked about the important problem of balkanization in modeling; this episode is about 
doing something about it in code.
</content:encoded>
        <link>http://modelingwithdata.org/arch/00000015.htm</link>
        <guid isPermaLink="false">http://modelingwithdata.org/arch/00000015.htm</guid>
	<dc:creator>Ben Klemens</dc:creator>
        <pubDate>4 May 2009 00:00:00 +0000</pubDate>
</item>

<item>
        <title>Bringing theoretical models to data</title>
	<content:encoded>
It's amazing how very differently people can read a phrase as simple as "Modeling with data".
</content:encoded>
        <link>http://modelingwithdata.org/arch/00000014.htm</link>
        <guid isPermaLink="false">http://modelingwithdata.org/arch/00000014.htm</guid>
	<dc:creator>Ben Klemens</dc:creator>
        <pubDate>1 May 2009 00:00:00 +0000</pubDate>
</item>

<item>
        <title>Operator overloading</title>
	<content:encoded>
Sometimes human language and mathematical language fall into conflict.
</content:encoded>
        <link>http://modelingwithdata.org/arch/00000013.htm</link>
        <guid isPermaLink="false">http://modelingwithdata.org/arch/00000013.htm</guid>
	<dc:creator>Ben Klemens</dc:creator>
        <pubDate>29 April 2009 00:00:00 +0000</pubDate>
</item>

<item>
        <title>Making integers two-dimensional</title>
	<content:encoded>
Or, a means of visualizing integer arithmetic.
</content:encoded>
        <link>http://modelingwithdata.org/arch/00000012.htm</link>
        <guid isPermaLink="false">http://modelingwithdata.org/arch/00000012.htm</guid>
	<dc:creator>Ben Klemens</dc:creator>
        <pubDate>22 April 2009 00:00:00 +0000</pubDate>
</item>

<item>
        <title>Using a program as a library</title>
	<content:encoded>
I'd been neglecting computing in favor of statistics, so here's a useful tip for managing your code.
</content:encoded>
        <link>http://modelingwithdata.org/arch/00000011.htm</link>
        <guid isPermaLink="false">http://modelingwithdata.org/arch/00000011.htm</guid>
	<dc:creator>Ben Klemens</dc:creator>
        <pubDate>13 April 2009 00:00:00 +0000</pubDate>
</item>

<item>
        <title>Your genetic data---ethics</title>
	<content:encoded>
I'll get back to this in a few episodes, but for now, the point here that the imbalance between descriptive and inferential power has ethical as well as purely research implications.
</content:encoded>
        <link>http://modelingwithdata.org/arch/00000010.htm</link>
        <guid isPermaLink="false">http://modelingwithdata.org/arch/00000010.htm</guid>
	<dc:creator>Ben Klemens</dc:creator>
        <pubDate>8 April 2009 00:00:00 +0000</pubDate>
</item>

<item>
        <title>Your genetic data</title>
	<content:encoded>
Given a hundred cases and controls and their genetic markers, I can find you markers to perfectly categorize them in under two minutes. Of course, those markers provide no predictive power outside those hundred subjects.
</content:encoded>
        <link>http://modelingwithdata.org/arch/00000009.htm</link>
        <guid isPermaLink="false">http://modelingwithdata.org/arch/00000009.htm</guid>
	<dc:creator>Ben Klemens</dc:creator>
        <pubDate>6 April 2009 00:00:00 +0000</pubDate>
</item>

<item>
        <title>Freakophenia</title>
	<content:encoded>
The summary of today's entry: if the research is exciting enough to be written up in the newspaper, it's probably not to be trusted.
</content:encoded>
        <link>http://modelingwithdata.org/arch/00000008.htm</link>
        <guid isPermaLink="false">http://modelingwithdata.org/arch/00000008.htm</guid>
	<dc:creator>Ben Klemens</dc:creator>
        <pubDate>3 April 2009 00:00:00 +0000</pubDate>
</item>

<item>
        <title>Breaking the pipeline</title>
	<content:encoded>
Much of this one is about the requirements for the OLS 
coefficients to provide the line of best fit (i.e., the Gauss-Markov theorem). If you want to play along, 
try to recall them before you click through. E.g., must the expected value of the 
errors be zero? Are errors assumed to be Normally distributed?
</content:encoded>
        <link>http://modelingwithdata.org/arch/00000007.htm</link>
        <guid isPermaLink="false">http://modelingwithdata.org/arch/00000007.htm</guid>
	<dc:creator>Ben Klemens</dc:creator>
        <pubDate>30 March 2009 00:00:00 +0000</pubDate>
</item>

<item>
        <title>Crowdsourcing data mining</title>
	<content:encoded>
I think that this gets to the question of what exactly is a hypothesis test. That's a question that we 
don't seriously have an answer to, though I may offer some comments at a later date.
</content:encoded>
        <link>http://modelingwithdata.org/arch/00000006.htm</link>
        <guid isPermaLink="false">http://modelingwithdata.org/arch/00000006.htm</guid>
	<dc:creator>Ben Klemens</dc:creator>
        <pubDate>24 March 2009 00:00:00 +0000</pubDate>
</item>

<item>
        <title>Dataviz</title>
	<content:encoded>
This is cut from a presentation I gave last November (so if you have time to kill, have the right plug-ins, and want to hear my voice, here's a &lt;a href="http://web.mac.com/jprater/iWeb/Site/Podcast/A2A2FC27-659C-4F12-AD02-A4A39521C862.html"&gt;link&lt;/a&gt;).
</content:encoded>
        <link>http://modelingwithdata.org/arch/00000005.htm</link>
        <guid isPermaLink="false">http://modelingwithdata.org/arch/00000005.htm</guid>
	<dc:creator>Ben Klemens</dc:creator>
        <pubDate>18 March 2009 00:00:00 +0000</pubDate>
</item>

<item>
        <title>Too many tests</title>
	<content:encoded>
Today for lunch, I had rice with wheat gluten, tomatoes, and cashews. The rice is somewhat sweet and very purple, but I can't tell you what kind it is exactly, because the package is entirely in Korean.
</content:encoded>
        <link>http://modelingwithdata.org/arch/00000004.htm</link>
        <guid isPermaLink="false">http://modelingwithdata.org/arch/00000004.htm</guid>
	<dc:creator>Ben Klemens</dc:creator>
        <pubDate>16 March 2009 00:00:00 +0000</pubDate>
</item>

<item>
        <title>Parallels II</title>
	<content:encoded>
This is a continuation of last episode, discussing how technology affects the math we do. Next time, I'll switch from computing to conflicts in stats.
</content:encoded>
        <link>http://modelingwithdata.org/arch/00000003.htm</link>
        <guid isPermaLink="false">http://modelingwithdata.org/arch/00000003.htm</guid>
	<dc:creator>Ben Klemens</dc:creator>
        <pubDate>13 March 2009 00:00:00 +0000</pubDate>
</item>

<item>
        <title>Moore's law won't save you</title>
	<content:encoded>
A lot of us are stuck in an optimistic early-90s mentality toward progress in computing power.
</content:encoded>
        <link>http://modelingwithdata.org/arch/00000002.htm</link>
        <guid isPermaLink="false">http://modelingwithdata.org/arch/00000002.htm</guid>
	<dc:creator>Ben Klemens</dc:creator>
        <pubDate>8 March 2009 00:00:00 +0000</pubDate>
</item>

<item>
        <title>Today I am a blog</title>
	<content:encoded>
I never really know what to say in these introductions.
</content:encoded>
        <link>http://modelingwithdata.org/arch/00000001.htm</link>
        <guid isPermaLink="false">http://modelingwithdata.org/arch/00000001.htm</guid>
	<dc:creator>Ben Klemens</dc:creator>
        <pubDate>6 March 2009 00:00:00 +0000</pubDate>
</item>

</channel>
</rss>
