<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	>

<channel>
	<title>Statistical Graphics and Data Visualization</title>
	<atom:link href="http://statgraphics.blog.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://statgraphics.blog.com</link>
	<description>Statistical Graphics, Data Visualization, Data Analysis, User Interfacesand related</description>
	<pubDate>Tue, 14 Nov 2006 21:10:01 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.7</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Good bye statgraphics™</title>
		<link>http://statgraphics.blog.com/2006/11/14/good-bye-statgraphics%e2%84%a2/</link>
		<comments>http://statgraphics.blog.com/2006/11/14/good-bye-statgraphics%e2%84%a2/#comments</comments>
		<pubDate>Tue, 14 Nov 2006 21:10:01 +0000</pubDate>
		<dc:creator>Martin</dc:creator>
		
		<guid isPermaLink="false"></guid>
		<description><![CDATA[Our "friends" at <a href="http://www.statpoint.com/">Statgraphics</a>™ (￼￼￼Statgraphics® is a registered trademark of StatPoint, Inc) sent me unfriendly letters and threaten me "appropriate legal actions" in case I use this URL any longer, and force me to move further posts to a new site.
<p>New posts will be at</p>
<center><a href="http://statisticalgraphics.blog.com">http://statisticalgraphics.blog.com</a></center>
<p>I will give further details on their strange behavior later ...<br />
(Some software companies seem to invest more money into lawyers than into good software developers)</p>

]]></description>
			<content:encoded><![CDATA[<div>Our &#8220;friends&#8221; at <a href="http://www.statpoint.com/">Statgraphics</a>™ (￼￼￼Statgraphics® is a registered trademark of StatPoint, Inc) sent me unfriendly letters and threaten me &#8220;appropriate legal actions&#8221; in case I use this URL any longer, and force me to move further posts to a new site.</p>
<p>New posts will be at</p>
<p><center><a href="http://statisticalgraphics.blog.com">http://statisticalgraphics.blog.com</a></center></p>
<p>I will give further details on their strange behavior later &#8230;<br />
(Some software companies seem to invest more money into lawyers than into good software developers)</p>
</div>
<div></div>
]]></content:encoded>
			<wfw:commentRss>http://statgraphics.blog.com/2006/11/14/good-bye-statgraphics%e2%84%a2/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Communities of Interest &#8230;</title>
		<link>http://statgraphics.blog.com/2006/10/17/communities-of-interest/</link>
		<comments>http://statgraphics.blog.com/2006/10/17/communities-of-interest/#comments</comments>
		<pubDate>Tue, 17 Oct 2006 21:48:13 +0000</pubDate>
		<dc:creator>Martin</dc:creator>
		
		<guid isPermaLink="false"></guid>
		<description><![CDATA[... is the name for subgraphs of a large network (e.g. telephone calls) which have certain target properties.
<p>Chris Volinsky has a very nice page, which allows to look for subgraphs that connect authors by papers in computer science journals (based on DBLP) or actors connected by movies (based on IMDB).</p>
<p>Here is the proximity graph that connects me (not being a computer scientist) with Donald E. Knuth: <a href="http://www.rosuda.org/~theus/Blog/Graph.png"><img width="440" src="http://www.rosuda.org/~theus/Blog/Graph.png" /></a></p>
<p>Visit Chris' page <a href="http://public.research.att.com/~volinsky/cgi-bin/prox/prox.pl">here</a>.</p>

]]></description>
			<content:encoded><![CDATA[<div>&#8230; is the name for subgraphs of a large network (e.g. telephone calls) which have certain target properties.</p>
<p>Chris Volinsky has a very nice page, which allows to look for subgraphs that connect authors by papers in computer science journals (based on DBLP) or actors connected by movies (based on IMDB).</p>
<p>Here is the proximity graph that connects me (not being a computer scientist) with Donald E. Knuth: <a href="http://www.rosuda.org/~theus/Blog/Graph.png"><img width="440" src="http://www.rosuda.org/~theus/Blog/Graph.png" /></a></p>
<p>Visit Chris&#8217; page <a href="http://public.research.att.com/~volinsky/cgi-bin/prox/prox.pl">here</a>.</p>
</div>
<div></div>
]]></content:encoded>
			<wfw:commentRss>http://statgraphics.blog.com/2006/10/17/communities-of-interest/feed/</wfw:commentRss>
		</item>
		<item>
		<title>statistical graphics 101: Histograms</title>
		<link>http://statgraphics.blog.com/2006/10/17/statistical-graphics-101-histograms/</link>
		<comments>http://statgraphics.blog.com/2006/10/17/statistical-graphics-101-histograms/#comments</comments>
		<pubDate>Tue, 17 Oct 2006 19:59:55 +0000</pubDate>
		<dc:creator>Martin</dc:creator>
		
		<guid isPermaLink="false"></guid>
		<description><![CDATA[It's been too long since the last posting (on <a href="http://statgraphics.blog.com/178439/">barcharts</a>) in the teaching corner. This one will be on histograms.
<p>Histograms are often mistaken with barcharts. The fundamental distinction between the two is</p>
<ul>
<li>Barcharts show counts (or weights) for the discrete axis of a <b>categorical</b> variable</li>
<li>Histograms show an approximation of the density function (if scaled accordingly) of a <b>countinuous</b> variable.</li>
</ul>
As a consequence, the only thing that can be quantified in a barchart is the bar <b>height</b> (better length, which makes it independent from their orientation). On the other hand, in a histogram, the <b>area</b> of the boxes is proportional to the density approximation. If all bars have the same width in a barcharts, or gaps are drawn in a histogram (which is complete nonsense), the two plots can get mixed up.
<center><img src="http://www.rosuda.org/~theus/Blog/histo.png" /><br />
(% votes for Kerry in the 2004 presidential election)</center>
<p>Much has been written about optimal bin width for histograms - almost nothing about the choice of the anchor point. Changing the latter often changes the shape of the histogram more dramatic than choosing between 8, 9 or 10 bins.<br />
Setting the anchor point from 0 to -2.4 yields:</p>
<center><img src="http://www.rosuda.org/~theus/Blog/anchor.png" /><br />
(changing the anchor point to -2.4)</center>
<p>In most applications, there are <b>sensible</b> breaks which can be chosen. Since we look at 3,111 voting districts, we can use far more bins and start at 0 with bin width of 5.</p>
<center><img src="http://www.rosuda.org/~theus/Blog/density.png" /><br />
(using meaningful parameters (0, 5) - density estimate added)</center>
<p>If we link a second attribute into the histogram, the whole thing gets more exiting!</p>
<center><img src="http://www.rosuda.org/~theus/Blog/hilite.png" /><br />
(all districts where more than 15% have a college degree selected)</center>
<p>We don't really can see what is going on here (although we might guess, that there is a slightly higher proportion of highlighting towards the higher votes for Kerry).</p>
<p>When we use the same normalization trick as for Spineplots (see <a href="http://statgraphics.blog.com/178439/">previous post</a>), we get the clearer picture of the <b>Spinogram</b>:</p>
<center><img src="http://www.rosuda.org/~theus/Blog/spinogram.png" /><br />
(highlighting in a Spinogram)</center>
<p>Well, that's what we would have expected, probably except for the increase at the left end of the scale.</p>
<p>The problem with the histograms used so far is that we looked at voting districts, and not at voters! This will distort the impression if the districts are not of equal size. Weighting above histogram will move the mode further to the right.</p>
<center><img src="http://www.rosuda.org/~theus/Blog/weighted.png" /><br />
(the weighted histogram represents voters not districts)</center>
<p>Finally we get the weighted spinogram, which probably supports more the hypothesis ... of the selected group.</p>
<center><img src="http://www.rosuda.org/~theus/Blog/spinoweight.png" /><br />
(the weighted spinogram)</center>
<p>(Sorry for the lengthy post ... but concepts get a bit more complex)</p>

]]></description>
			<content:encoded><![CDATA[<div>It&#8217;s been too long since the last posting (on <a href="http://statgraphics.blog.com/178439/">barcharts</a>) in the teaching corner. This one will be on histograms.</p>
<p>Histograms are often mistaken with barcharts. The fundamental distinction between the two is</p>
<ul>
<li>Barcharts show counts (or weights) for the discrete axis of a <b>categorical</b> variable</li>
<li>Histograms show an approximation of the density function (if scaled accordingly) of a <b>countinuous</b> variable.</li>
</ul>
<p>As a consequence, the only thing that can be quantified in a barchart is the bar <b>height</b> (better length, which makes it independent from their orientation). On the other hand, in a histogram, the <b>area</b> of the boxes is proportional to the density approximation. If all bars have the same width in a barcharts, or gaps are drawn in a histogram (which is complete nonsense), the two plots can get mixed up.<br />
<center><img src="http://www.rosuda.org/~theus/Blog/histo.png" /><br />
(% votes for Kerry in the 2004 presidential election)</center></p>
<p>Much has been written about optimal bin width for histograms - almost nothing about the choice of the anchor point. Changing the latter often changes the shape of the histogram more dramatic than choosing between 8, 9 or 10 bins.<br />
Setting the anchor point from 0 to -2.4 yields:</p>
<p><center><img src="http://www.rosuda.org/~theus/Blog/anchor.png" /><br />
(changing the anchor point to -2.4)</center></p>
<p>In most applications, there are <b>sensible</b> breaks which can be chosen. Since we look at 3,111 voting districts, we can use far more bins and start at 0 with bin width of 5.</p>
<p><center><img src="http://www.rosuda.org/~theus/Blog/density.png" /><br />
(using meaningful parameters (0, 5) - density estimate added)</center></p>
<p>If we link a second attribute into the histogram, the whole thing gets more exiting!</p>
<p><center><img src="http://www.rosuda.org/~theus/Blog/hilite.png" /><br />
(all districts where more than 15% have a college degree selected)</center></p>
<p>We don&#8217;t really can see what is going on here (although we might guess, that there is a slightly higher proportion of highlighting towards the higher votes for Kerry).</p>
<p>When we use the same normalization trick as for Spineplots (see <a href="http://statgraphics.blog.com/178439/">previous post</a>), we get the clearer picture of the <b>Spinogram</b>:</p>
<p><center><img src="http://www.rosuda.org/~theus/Blog/spinogram.png" /><br />
(highlighting in a Spinogram)</center></p>
<p>Well, that&#8217;s what we would have expected, probably except for the increase at the left end of the scale.</p>
<p>The problem with the histograms used so far is that we looked at voting districts, and not at voters! This will distort the impression if the districts are not of equal size. Weighting above histogram will move the mode further to the right.</p>
<p><center><img src="http://www.rosuda.org/~theus/Blog/weighted.png" /><br />
(the weighted histogram represents voters not districts)</center></p>
<p>Finally we get the weighted spinogram, which probably supports more the hypothesis &#8230; of the selected group.</p>
<p><center><img src="http://www.rosuda.org/~theus/Blog/spinoweight.png" /><br />
(the weighted spinogram)</center></p>
<p>(Sorry for the lengthy post &#8230; but concepts get a bit more complex)</p>
</div>
<div></div>
]]></content:encoded>
			<wfw:commentRss>http://statgraphics.blog.com/2006/10/17/statistical-graphics-101-histograms/feed/</wfw:commentRss>
		</item>
		<item>
		<title>The Good &#38; the Bad [10/2006]</title>
		<link>http://statgraphics.blog.com/2006/10/14/the-good-the-bad-102006/</link>
		<comments>http://statgraphics.blog.com/2006/10/14/the-good-the-bad-102006/#comments</comments>
		<pubDate>Sat, 14 Oct 2006 11:16:56 +0000</pubDate>
		<dc:creator>Martin</dc:creator>
		
		<guid isPermaLink="false"></guid>
		<description><![CDATA[Antony pointed me to this nice example found on BBC News. <img src="http://newsimg.bbc.co.uk/media/images/41604000/gif/_41604246_prisons_2graph416x241.gif" /><br />
<br />
So what is the message here? "Chinese and other foreigners (not being British nationals) more and more fill Brirish jails ..."
<p>Well, as we look at the percentage changes, we do not have any clue about the underlying group sizes. As whites are by far the larger group in this example, the absolute increase for whites is certainly much bigger. Any better display at hand?</p>
<p>So called "Skyline Plots" - as implemented in <a href="http://www.rosuda.org/Renoir/">RENOIR</a> - take the absolute size of groups into account by adjusting the bin width, such that the plot covers both aspects: absolute and relative change.</p>
<p><a href="http://www.rosuda.org/Renoir/Images/RenoirDAX.JPG"><img width="252" src="http://www.rosuda.org/Renoir/Images/RenoirDAX.JPG" /></a><br />
(This is certainly a different example as the one from BBC News, but without the absolute figures it is impossible to recreate the skyline plot for the prison example.)</p>
<p>Looking at the colors, we find the odd choice of coding an increase of prisoners in green and the decrease in red. (Does not make much sense, unless the graph comes from the company which runs the prinson ...)</p>

]]></description>
			<content:encoded><![CDATA[<div>Antony pointed me to this nice example found on BBC News. <img src="http://newsimg.bbc.co.uk/media/images/41604000/gif/_41604246_prisons_2graph416x241.gif" /></p>
<p>So what is the message here? &#8220;Chinese and other foreigners (not being British nationals) more and more fill Brirish jails &#8230;&#8221;</p>
<p>Well, as we look at the percentage changes, we do not have any clue about the underlying group sizes. As whites are by far the larger group in this example, the absolute increase for whites is certainly much bigger. Any better display at hand?</p>
<p>So called &#8220;Skyline Plots&#8221; - as implemented in <a href="http://www.rosuda.org/Renoir/">RENOIR</a> - take the absolute size of groups into account by adjusting the bin width, such that the plot covers both aspects: absolute and relative change.</p>
<p><a href="http://www.rosuda.org/Renoir/Images/RenoirDAX.JPG"><img width="252" src="http://www.rosuda.org/Renoir/Images/RenoirDAX.JPG" /></a><br />
(This is certainly a different example as the one from BBC News, but without the absolute figures it is impossible to recreate the skyline plot for the prison example.)</p>
<p>Looking at the colors, we find the odd choice of coding an increase of prisoners in green and the decrease in red. (Does not make much sense, unless the graph comes from the company which runs the prinson &#8230;)</p>
</div>
<div></div>
]]></content:encoded>
			<wfw:commentRss>http://statgraphics.blog.com/2006/10/14/the-good-the-bad-102006/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Le Tour 2006</title>
		<link>http://statgraphics.blog.com/2006/06/30/le-tour-2006/</link>
		<comments>http://statgraphics.blog.com/2006/06/30/le-tour-2006/#comments</comments>
		<pubDate>Fri, 30 Jun 2006 18:00:52 +0000</pubDate>
		<dc:creator>Martin</dc:creator>
		
		<guid isPermaLink="false"></guid>
		<description><![CDATA[That's it for this year. Au revoir 2007!.<br />
<table>
<tr>
<th>Stage Results</th>
<th>cumulative Time</th>
<th>Ranks</th>
</tr>
<tr>
<td><a href="http://www.rosuda.org/~theus/Blog/Stage.png"><img border="0" width="150" src="http://www.rosuda.org/~theus/Blog/Stage.png" alt="Stage" /></a></td>
<td><a href="http://www.rosuda.org/~theus/Blog/Total.png"><img border="0" width="150" src="http://www.rosuda.org/~theus/Blog/Total.png" alt="Total" /></a></td>
<td><a href="http://www.rosuda.org/~theus/Blog/Rank.png"><img border="0" width="150" src="http://www.rosuda.org/~theus/Blog/Rank.png" alt="Rank" /></a></td>
</tr>
<tr>
<td colspan="3">(click on the images to enlarge)</td>
</tr>
</table>
<p>Prolog: Did anyone have Thor HUSHOVD on his list?<br />
Stage 01: Except for 5, all arrived in the peleton<br />
Stage 02: MC EWEN already on 3.<br />
Stage 03: First drop outs<br />
Stage 04: BOONEN still keeps the yellow jersey<br />
Stage 05: Only O'GRADY can't keep the pace of the top 10 from the Prolog<br />
Stage 06: First 37 still in a window of one minute<br />
Stage 07: This was the day of team T-Mobile!<br />
Stage 08: No stage to remember ...<br />
Stage 09: Still waiting for the mountains, so we look at Sebastian JOLY ...<br />
Stage 10: MERCADO and DESSEL out of the blue?<br />
Stage 11: LANDIS now in yellow<br />
Stage 12: POPOVYCH's day, now on 10. 5 withdrawls after the mountains.<br />
Stage 13: PEREIRO SIO's second place awards him the yellow jersey.<br />
Stage 14: No changes within the top 17.<br />
Stage 15: LANDIS back in yellow; two more mountain stages; down to 152<br />
Stage 16: LANDIS passes yellow to PEREIRO, KLÖDEN the real winner in the end?<br />
Stage 17: LANDIS back after great ride; top 3 within 30''<br />
Stage 18: No changes, we are all waiting for the show down tomorrow<br />
Stage 19: Everything as expected, LANDIS too strong for PEREIRO<br />
Stage 20: Profile of a winner ...<br /></p>
<p>To play with the data yourself, get the <a href="http://www.rosuda.org/Mondrian/">Mondrian</a> software and the <a href="http://www.rosuda.org/~theus/Blog/TDF2006.txt">dataset</a>. Thanks goes to Sergej Potapov, who wrote the script to manage the data!</p>

]]></description>
			<content:encoded><![CDATA[<div>That&#8217;s it for this year. Au revoir 2007!.</p>
<table>
<tr>
<th>Stage Results</th>
<th>cumulative Time</th>
<th>Ranks</th>
</tr>
<tr>
<td><a href="http://www.rosuda.org/~theus/Blog/Stage.png"><img border="0" width="150" src="http://www.rosuda.org/~theus/Blog/Stage.png" alt="Stage" /></a></td>
<td><a href="http://www.rosuda.org/~theus/Blog/Total.png"><img border="0" width="150" src="http://www.rosuda.org/~theus/Blog/Total.png" alt="Total" /></a></td>
<td><a href="http://www.rosuda.org/~theus/Blog/Rank.png"><img border="0" width="150" src="http://www.rosuda.org/~theus/Blog/Rank.png" alt="Rank" /></a></td>
</tr>
<tr>
<td colspan="3">(click on the images to enlarge)</td>
</tr>
</table>
<p>Prolog: Did anyone have Thor HUSHOVD on his list?<br />
Stage 01: Except for 5, all arrived in the peleton<br />
Stage 02: MC EWEN already on 3.<br />
Stage 03: First drop outs<br />
Stage 04: BOONEN still keeps the yellow jersey<br />
Stage 05: Only O&#8217;GRADY can&#8217;t keep the pace of the top 10 from the Prolog<br />
Stage 06: First 37 still in a window of one minute<br />
Stage 07: This was the day of team T-Mobile!<br />
Stage 08: No stage to remember &#8230;<br />
Stage 09: Still waiting for the mountains, so we look at Sebastian JOLY &#8230;<br />
Stage 10: MERCADO and DESSEL out of the blue?<br />
Stage 11: LANDIS now in yellow<br />
Stage 12: POPOVYCH&#8217;s day, now on 10. 5 withdrawls after the mountains.<br />
Stage 13: PEREIRO SIO&#8217;s second place awards him the yellow jersey.<br />
Stage 14: No changes within the top 17.<br />
Stage 15: LANDIS back in yellow; two more mountain stages; down to 152<br />
Stage 16: LANDIS passes yellow to PEREIRO, KLÖDEN the real winner in the end?<br />
Stage 17: LANDIS back after great ride; top 3 within 30&#8221;<br />
Stage 18: No changes, we are all waiting for the show down tomorrow<br />
Stage 19: Everything as expected, LANDIS too strong for PEREIRO<br />
Stage 20: Profile of a winner &#8230;</p>
<p>To play with the data yourself, get the <a href="http://www.rosuda.org/Mondrian/">Mondrian</a> software and the <a href="http://www.rosuda.org/~theus/Blog/TDF2006.txt">dataset</a>. Thanks goes to Sergej Potapov, who wrote the script to manage the data!</p>
</div>
<div></div>
]]></content:encoded>
			<wfw:commentRss>http://statgraphics.blog.com/2006/06/30/le-tour-2006/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Chart Junk?!</title>
		<link>http://statgraphics.blog.com/2006/06/26/chart-junk/</link>
		<comments>http://statgraphics.blog.com/2006/06/26/chart-junk/#comments</comments>
		<pubDate>Mon, 26 Jun 2006 10:49:44 +0000</pubDate>
		<dc:creator>Martin</dc:creator>
		
		<guid isPermaLink="false"></guid>
		<description><![CDATA[Here is an example of the so called "Sectioned Density Plot", which was recently published in "The American Statistician" (<a href="http://www.ingentaconnect.com/content/asa/tas/2006/00000060/00000002/art00010;jsessionid=1f4g6jeslicbb.alice">Vol 60, No. 2, 167-174</a>).
<p><img src="http://am.air.org/help/NAEPTextbook/images/SectionedDensityPlot.jpg" /></p>
<p>Using a simple histogram, maybe with an added density estimator, and/or a simple standard boxplot for group comparison does the job here. No need to "invent" a new plot, which introduces more problems as it solves any.<br />
Actually this plot makes a good case against the use of graphics ...</p>

]]></description>
			<content:encoded><![CDATA[<div>Here is an example of the so called &#8220;Sectioned Density Plot&#8221;, which was recently published in &#8220;The American Statistician&#8221; (<a href="http://www.ingentaconnect.com/content/asa/tas/2006/00000060/00000002/art00010;jsessionid=1f4g6jeslicbb.alice">Vol 60, No. 2, 167-174</a>).</p>
<p><img src="http://am.air.org/help/NAEPTextbook/images/SectionedDensityPlot.jpg" /></p>
<p>Using a simple histogram, maybe with an added density estimator, and/or a simple standard boxplot for group comparison does the job here. No need to &#8220;invent&#8221; a new plot, which introduces more problems as it solves any.<br />
Actually this plot makes a good case against the use of graphics &#8230;</p>
</div>
<div></div>
]]></content:encoded>
			<wfw:commentRss>http://statgraphics.blog.com/2006/06/26/chart-junk/feed/</wfw:commentRss>
		</item>
		<item>
		<title>The Good &#38; the Bad [6/2006]</title>
		<link>http://statgraphics.blog.com/2006/06/12/the-good-the-bad-62006/</link>
		<comments>http://statgraphics.blog.com/2006/06/12/the-good-the-bad-62006/#comments</comments>
		<pubDate>Mon, 12 Jun 2006 19:28:28 +0000</pubDate>
		<dc:creator>Martin</dc:creator>
		
		<guid isPermaLink="false"></guid>
		<description><![CDATA[Haven't we been preaching against 3-d barcharts and the like for a long time? Here is what we didn't think of in our wildest dreams: an animated 3-d barchart!
<center><object classid="clsid:02BF25D5-8C17-4B23-BC80-D3488ABDDC6B" codebase="http://www.apple.com/qtactivex/qtplugin.cab" height="255" width="392"><param name="cache" value="true" />
<param name="src" value="http://rosuda.org/~theus/Blog/barchart.mov" />
<param name="autoplay" value="true" />
<param name="controller" value="true" />
<param name="loop" value="true" />
<embed height="255" pluginspage="http://www.apple.com/quicktime/download/" src="http://rosuda.org/~theus/Blog/barchart.mov" type="video/mpeg" width="392" controller="true" autoplay="true" cache="true" loop="true" /></object></center>
<br />
As you might guess, this is the usage statistics from this blog over the last week. It is not hard to draw "The Good" (far less sensational)
<p><img src="http://rosuda.org/~theus/Blog/Barchart.png" /></p>
<p>(This statistics also tells me why my quota is used up, so I moved most of the recent stuff to another server!)</p>

]]></description>
			<content:encoded><![CDATA[<div>Haven&#8217;t we been preaching against 3-d barcharts and the like for a long time? Here is what we didn&#8217;t think of in our wildest dreams: an animated 3-d barchart!<br />
<center><object classid="clsid:02BF25D5-8C17-4B23-BC80-D3488ABDDC6B" codebase="http://www.apple.com/qtactivex/qtplugin.cab" height="255" width="392"><param name="cache" value="true" /><param name="src" value="http://rosuda.org/~theus/Blog/barchart.mov" /><param name="autoplay" value="true" /><param name="controller" value="true" /><param name="loop" value="true" /><embed height="255" pluginspage="http://www.apple.com/quicktime/download/" src="http://rosuda.org/~theus/Blog/barchart.mov" type="video/mpeg" width="392" controller="true" autoplay="true" cache="true" loop="true" /></object></center><br />
<br />
As you might guess, this is the usage statistics from this blog over the last week. It is not hard to draw &#8220;The Good&#8221; (far less sensational)</p>
<p><img src="http://rosuda.org/~theus/Blog/Barchart.png" /></p>
<p>(This statistics also tells me why my quota is used up, so I moved most of the recent stuff to another server!)</p>
</div>
<div></div>
]]></content:encoded>
			<wfw:commentRss>http://statgraphics.blog.com/2006/06/12/the-good-the-bad-62006/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Tour de France 2005 Update</title>
		<link>http://statgraphics.blog.com/2006/06/03/tour-de-france-2005-update/</link>
		<comments>http://statgraphics.blog.com/2006/06/03/tour-de-france-2005-update/#comments</comments>
		<pubDate>Sat, 03 Jun 2006 09:42:50 +0000</pubDate>
		<dc:creator>Martin</dc:creator>
		
		<guid isPermaLink="false"></guid>
		<description><![CDATA[Well, the Tour de France 2006 is only a few weeks ahead, probably still time to give an update on the 2005 data.<br />
We now have the data recompiled with drop outs (thanks to Sergej).<br />
<a href="http://rosuda.org/~theus/Blog/TDFMissing.png"><img width="450" src="http://rosuda.org/~theus/Blog/TDFMissing.png" /></a>
<p>Interesting to see that almost half of the cyclists categorized as "sprinter" don't make it to the Champs-Elysees.</p>
<p>Looking at the ranks is quite funny as well. Here's what happens when you start as first and last ... <a href="http://rosuda.org/~theus/Blog/TDFMissingRank.png"><img width="450" src="http://rosuda.org/~theus/Blog/TDFMissingRank.png" /></a><br />
(Certainly, David Zabriskie would have probably looked better when not having the crash in the team time trial)</p>

]]></description>
			<content:encoded><![CDATA[<div>Well, the Tour de France 2006 is only a few weeks ahead, probably still time to give an update on the 2005 data.<br />
We now have the data recompiled with drop outs (thanks to Sergej).<br />
<a href="http://rosuda.org/~theus/Blog/TDFMissing.png"><img width="450" src="http://rosuda.org/~theus/Blog/TDFMissing.png" /></a></p>
<p>Interesting to see that almost half of the cyclists categorized as &#8220;sprinter&#8221; don&#8217;t make it to the Champs-Elysees.</p>
<p>Looking at the ranks is quite funny as well. Here&#8217;s what happens when you start as first and last &#8230; <a href="http://rosuda.org/~theus/Blog/TDFMissingRank.png"><img width="450" src="http://rosuda.org/~theus/Blog/TDFMissingRank.png" /></a><br />
(Certainly, David Zabriskie would have probably looked better when not having the crash in the team time trial)</p>
</div>
<div></div>
]]></content:encoded>
			<wfw:commentRss>http://statgraphics.blog.com/2006/06/03/tour-de-france-2005-update/feed/</wfw:commentRss>
		</item>
		<item>
		<title>MS Chart Junk</title>
		<link>http://statgraphics.blog.com/2006/06/03/ms-chart-junk/</link>
		<comments>http://statgraphics.blog.com/2006/06/03/ms-chart-junk/#comments</comments>
		<pubDate>Sat, 03 Jun 2006 09:29:50 +0000</pubDate>
		<dc:creator>Martin</dc:creator>
		
		<guid isPermaLink="false"></guid>
		<description><![CDATA[For those of you who do not stop by at <a href="http://junkcharts.typepad.com/junk_charts/">JunkCharts</a> regularly, here is what John S. found:<br />
<a href="http://junkcharts.typepad.com/junk_charts/"><img width="440" src="http://common.ziffdavisinternet.com/util_get_image/12/0,1425,sz=1&#38;i=128393,00.jpg" /></a><br />
Probably one of the best example of "featurism"; computer scientists set free with no idea of application ...
]]></description>
			<content:encoded><![CDATA[<div>For those of you who do not stop by at <a href="http://junkcharts.typepad.com/junk_charts/">JunkCharts</a> regularly, here is what John S. found:<br />
<a href="http://junkcharts.typepad.com/junk_charts/"><img width="440" src="http://common.ziffdavisinternet.com/util_get_image/12/0,1425,sz=1&amp;i=128393,00.jpg" /></a><br />
Probably one of the best example of &#8220;featurism&#8221;; computer scientists set free with no idea of application &#8230;
</div>
<div></div>
]]></content:encoded>
			<wfw:commentRss>http://statgraphics.blog.com/2006/06/03/ms-chart-junk/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Minard&#8217;s birthday challenge</title>
		<link>http://statgraphics.blog.com/2006/03/29/minards-birthday-challenge/</link>
		<comments>http://statgraphics.blog.com/2006/03/29/minards-birthday-challenge/#comments</comments>
		<pubDate>Wed, 29 Mar 2006 21:51:26 +0000</pubDate>
		<dc:creator>Martin</dc:creator>
		
		<guid isPermaLink="false"></guid>
		<description><![CDATA[We got this from <a href="mailto:Pukelsheim@Math.Uni-Augsburg.De">Friedrich</a> who is just finishing a sabbatical in Paris:<br />
<a href="http://rosuda.org/~theus/Blog/leMonde.jpg"><img src="http://rosuda.org/~theus/Blog/leMondeSmall.jpg" /></a><br />
<a href="mailto:friendly@yorku.ca">Michael</a> came up with the idea of a challenge. Here's what he wrote:
<p><i>Here's a graph you may find interesting, published yesterday in <a href="http://www.lemonde.fr/web/infog/0,47-0@2-3224,54-755523@51-725561,0.html">Le Monde</a>.<br />
The manifest goal is to compare estimates of les manifestants at les manifestations according to the unions vs. the police throughout France. An admirable goal for a newspaper, and done with some sensitivity to graphic principles, in a style reminiscent of les Albums: color-coding key (but no scale), time-series sub-graph, numbers included for those who want/need them.<br />
<br />
But what do we see, and is there anything we could see better by some other display or encoding of these numbers? Obviously, either the police in Paris can't count or deliberately misrepresent things, or the unions there have enormously dilated pupils. Is the over-count ratio greater in Paris than in Marseilles? Were the police in Mont de Marsan all en cong'e? Is Dijon the place with the closest correspondence between the estimates? Presumably, they used count ~ area (or sqrt radius); should they have used a different power for better perceptual comparison? What is missing from this display? e.g, how did the turnout in Toulouse compare to that in Nante or Rennes, taking population into account?<br />
best,<br />
-Michael</i></p>
<p>You can download the data <a href="http://www.rosuda.org/~theus/LeMonde.txt">here</a>. Any comments are welcome and of course, any attempt of a better visualization will be posted here!<br />
<br />
Have fun!</p>
<hr />
<p>Here are two posts:</p>
<p>Michael sent this graphics:<br />
<img src="http://rosuda.org/~theus/Blog/leMondeM.gif" width="275" height="300" /><br />
He wrote:<br />
<i>"Here is a simple log-log scatterplot that answers some of the questions posed originally. On a log scale, the relation is fairly linear, with a slope of 1.19, indicating that union estimates are on average about 154% of those by the police. I've labelled only those cities that have a log-residual greater than 0.15, and scaled the axes so that the line of equality is approx. 45 deg."</i></p>
<p>Romain's (updated) suggestion is:<br />
<a href="http://rosuda.org/~theus/Blog/cpe-2.png"><img src="http://rosuda.org/~theus/Blog/cpe-2.png" width="400" height="400" /></a><br />
(See also his comment!)</p>

]]></description>
			<content:encoded><![CDATA[<div>We got this from <a href="mailto:Pukelsheim@Math.Uni-Augsburg.De">Friedrich</a> who is just finishing a sabbatical in Paris:<br />
<a href="http://rosuda.org/~theus/Blog/leMonde.jpg"><img src="http://rosuda.org/~theus/Blog/leMondeSmall.jpg" /></a><br />
<a href="mailto:friendly@yorku.ca">Michael</a> came up with the idea of a challenge. Here&#8217;s what he wrote:</p>
<p><i>Here&#8217;s a graph you may find interesting, published yesterday in <a href="http://www.lemonde.fr/web/infog/0,47-0@2-3224,54-755523@51-725561,0.html">Le Monde</a>.<br />
The manifest goal is to compare estimates of les manifestants at les manifestations according to the unions vs. the police throughout France. An admirable goal for a newspaper, and done with some sensitivity to graphic principles, in a style reminiscent of les Albums: color-coding key (but no scale), time-series sub-graph, numbers included for those who want/need them.</p>
<p>But what do we see, and is there anything we could see better by some other display or encoding of these numbers? Obviously, either the police in Paris can&#8217;t count or deliberately misrepresent things, or the unions there have enormously dilated pupils. Is the over-count ratio greater in Paris than in Marseilles? Were the police in Mont de Marsan all en cong&#8217;e? Is Dijon the place with the closest correspondence between the estimates? Presumably, they used count ~ area (or sqrt radius); should they have used a different power for better perceptual comparison? What is missing from this display? e.g, how did the turnout in Toulouse compare to that in Nante or Rennes, taking population into account?<br />
best,<br />
-Michael</i></p>
<p>You can download the data <a href="http://www.rosuda.org/~theus/LeMonde.txt">here</a>. Any comments are welcome and of course, any attempt of a better visualization will be posted here!</p>
<p>Have fun!</p>
<hr />
<p>Here are two posts:</p>
<p>Michael sent this graphics:<br />
<img src="http://rosuda.org/~theus/Blog/leMondeM.gif" width="275" height="300" /><br />
He wrote:<br />
<i>&#8220;Here is a simple log-log scatterplot that answers some of the questions posed originally. On a log scale, the relation is fairly linear, with a slope of 1.19, indicating that union estimates are on average about 154% of those by the police. I&#8217;ve labelled only those cities that have a log-residual greater than 0.15, and scaled the axes so that the line of equality is approx. 45 deg.&#8221;</i></p>
<p>Romain&#8217;s (updated) suggestion is:<br />
<a href="http://rosuda.org/~theus/Blog/cpe-2.png"><img src="http://rosuda.org/~theus/Blog/cpe-2.png" width="400" height="400" /></a><br />
(See also his comment!)</p>
</div>
<div></div>
]]></content:encoded>
			<wfw:commentRss>http://statgraphics.blog.com/2006/03/29/minards-birthday-challenge/feed/</wfw:commentRss>
		</item>
	</channel>
</rss>
