<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Michelle Minkoff &#187; processing</title>
	<atom:link href="http://michelleminkoff.com/category/processing/feed/" rel="self" type="application/rss+xml" />
	<link>http://michelleminkoff.com</link>
	<description>=SUM (Passion + journalism + data + technology)</description>
	<lastBuildDate>Mon, 16 Jan 2012 20:20:23 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Is a flat text file or a database right for an app?</title>
		<link>http://michelleminkoff.com/2010/02/21/text-file-or-database/</link>
		<comments>http://michelleminkoff.com/2010/02/21/text-file-or-database/#comments</comments>
		<pubDate>Sun, 21 Feb 2010 18:08:01 +0000</pubDate>
		<dc:creator>Michelle Minkoff</dc:creator>
				<category><![CDATA[CAR]]></category>
		<category><![CDATA[data visualizations]]></category>
		<category><![CDATA[processing]]></category>
		<category><![CDATA[sql]]></category>
		<category><![CDATA[aggregate]]></category>
		<category><![CDATA[application]]></category>
		<category><![CDATA[approaching the end]]></category>
		<category><![CDATA[art galleries]]></category>
		<category><![CDATA[bubble chart]]></category>
		<category><![CDATA[chicago artists]]></category>
		<category><![CDATA[city limits]]></category>
		<category><![CDATA[computer asssisted reporting]]></category>
		<category><![CDATA[current issue]]></category>
		<category><![CDATA[data visualization]]></category>
		<category><![CDATA[database]]></category>
		<category><![CDATA[flat text file]]></category>
		<category><![CDATA[independent study]]></category>
		<category><![CDATA[interface]]></category>
		<category><![CDATA[metro chicago area]]></category>
		<category><![CDATA[neighborhood]]></category>
		<category><![CDATA[paintings]]></category>
		<category><![CDATA[pamphlet]]></category>
		<category><![CDATA[principle]]></category>
		<category><![CDATA[sculptures]]></category>
		<category><![CDATA[sql statements]]></category>
		<category><![CDATA[statistics]]></category>
		<category><![CDATA[text]]></category>
		<category><![CDATA[theory]]></category>
		<category><![CDATA[visualizing data]]></category>
		<category><![CDATA[zip code]]></category>

		<guid isPermaLink="false">http://michelleminkoff.com/?p=692</guid>
		<description><![CDATA[I&#8217;ve been busily coding away on my final project for my data visualization independent study. I&#8217;m looking at data that I&#8217;ve been slowly acquiring on the life of art galleries that were in Chicago in 1990.  Back then, the Chicago Artists&#8217; Coalition had a complete listing in a pamphlet (more like a book) they put [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve been busily coding away on my final project for my data visualization independent study. I&#8217;m looking at data that I&#8217;ve been slowly acquiring on the life of art galleries that were in Chicago in 1990.  Back then, the Chicago Artists&#8217; Coalition had a complete listing in a pamphlet (more like a book) they put out for artists.  It included galleries within the city, in the metro Chicago area, and even downstate.  I&#8217;m choosing to focus on ones that were within Chicago&#8217;s city limits in 1990, and I&#8217;m looking into what happened to them.  Ideally, the visualization will use the principle of a bubble chart to show how many galleries, of those 96, existed in 1990, 1995, 2000, 2005 and 2010.  You will also be able to filter by zip code, which sheds light on what neighborhood housed galleries that have lasted the longest, and perhaps more interestingly, by medium, showing how these statistics change if you look at galleries only specializing in paintings, or only in sculptures.<span id="more-692"></span></p>
<p>One part I&#8217;ve been slowly conquering is connecting my SQLite databse to the Processing application.  My current issue is closing the database, the query just runs and runs and runs and&#8230;you get the point.  I&#8217;m using a database because these counts need to be able to adjust depending on where you click, and I figure that SQL statements are the best way to filter and aggregate the various information.</p>
<p>But today, in my reading from Ben Fry&#8217;s Visualizing Data, which I&#8217;m approaching the end of, Fry was discussing acquiring data.  Many of the methods I&#8217;d come across before in my own research, they focused on how to hook up the database to my interface.  Fry suggested to reconsider whether you need a database, or can use programming to manipulate a flat text file.  With the way I&#8217;ve designed this application, I need to count up the number of records where a column meets a certain condition.  I was trying to do that with a count in SQL.  But I can just have the program do a loop, and count it programatically.</p>
<p>Perhaps using a database is a better fit when doing joins or complicated queries, or if one has millions, or even thousands, of records.  But if performing counts, and pulling records that meet certain conditions, is all that&#8217;s necessary, right now I&#8217;m thinking that a flat text file is best.</p>
<h3  class="related_post_title">Related posts you might enjoy:</h3><ul class="related_post"><li>February 15, 2010 -- <a href="http://michelleminkoff.com/2010/02/15/striving-toward-visual-storytelling-not-just-presentation/" title="Visual confections are more than mere presentation">Visual confections are more than mere presentation</a></li><li>February 1, 2010 -- <a href="http://michelleminkoff.com/2010/02/01/relating-zip-codes-and-geography-using-processing/" title="Relating zip codes and geography using Processing">Relating zip codes and geography using Processing</a></li><li>January 10, 2010 -- <a href="http://michelleminkoff.com/2010/01/10/which-states-have-been-hit-hardest-by-unemployment-in-the-last-decade/" title="Which states have been hit hardest by unemployment in the last decade?">Which states have been hit hardest by unemployment in the last decade?</a></li><li>March 3, 2010 -- <a href="http://michelleminkoff.com/2010/03/03/persistence-of-chicago-art-galleries/" title="Persistence of Chicago Art Galleries">Persistence of Chicago Art Galleries</a></li><li>February 28, 2010 -- <a href="http://michelleminkoff.com/2010/02/28/django-app-2-conquering-forms-and-the-google-api/" title="Django app #2: Conquering forms and Google Maps API">Django app #2: Conquering forms and Google Maps API</a></li><li>February 15, 2010 -- <a href="http://michelleminkoff.com/2010/02/15/visualizing-networking-when-it-doesnt-work/" title="Visualizing networking: When it doesn&#8217;t work">Visualizing networking: When it doesn&#8217;t work</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://michelleminkoff.com/2010/02/21/text-file-or-database/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Visualizing networking: When it doesn&#8217;t work</title>
		<link>http://michelleminkoff.com/2010/02/15/visualizing-networking-when-it-doesnt-work/</link>
		<comments>http://michelleminkoff.com/2010/02/15/visualizing-networking-when-it-doesnt-work/#comments</comments>
		<pubDate>Tue, 16 Feb 2010 03:10:17 +0000</pubDate>
		<dc:creator>Michelle Minkoff</dc:creator>
				<category><![CDATA[class]]></category>
		<category><![CDATA[data visualizations]]></category>
		<category><![CDATA[processing]]></category>
		<category><![CDATA[programming]]></category>
		<category><![CDATA[audience]]></category>
		<category><![CDATA[both genders]]></category>
		<category><![CDATA[fail]]></category>
		<category><![CDATA[getting a job]]></category>
		<category><![CDATA[great potential]]></category>
		<category><![CDATA[huckleberry finn]]></category>
		<category><![CDATA[independent study]]></category>
		<category><![CDATA[journalism]]></category>
		<category><![CDATA[mark twain]]></category>
		<category><![CDATA[networking]]></category>
		<category><![CDATA[observation]]></category>
		<category><![CDATA[political agenda]]></category>
		<category><![CDATA[programming exercise]]></category>
		<category><![CDATA[s programming]]></category>
		<category><![CDATA[social networking]]></category>
		<category><![CDATA[story idea]]></category>
		<category><![CDATA[visualization technique]]></category>
		<category><![CDATA[visualizing data]]></category>

		<guid isPermaLink="false">http://michelleminkoff.com/?p=633</guid>
		<description><![CDATA[So much in journalism and in life, we strive for perfection in our work. The best story idea, the best presentation.  We strive for perfection as individuals, we compete to be the best, get to the scene before someone else, write tighter, edit faster.  But sometimes, it’s just as important to recognize when something just [...]]]></description>
			<content:encoded><![CDATA[<p>So much in journalism and in life, we strive for perfection in our work. The best story idea, the best presentation.  We strive for perfection as individuals, we compete to be the best, get to the scene before someone else, write tighter, edit faster.  But sometimes, it’s just as important to recognize when something just didn’t work.  I think that’s still an achievement, the real problem is if you think succeeded, when in reality you have failed your audience. That was my lesson of the week.</p>
<p>Social networking analysis is a topic that my independent study advisor Rich Gordon and I have been discussing often during our weekly meetings and through materials he’s pointed me to, as well as my own reading and observation, I think it has some great potential in terms of journalistic applications.  Whether it’s about getting a job or pushing your political agenda, so much in this world is about who you know. <span id="more-633"></span></p>
<p>That’s why I was so excited that this week’s programming exercise in Ben Fry’s Visualizing Data book, which I’m using to learn Processing, focused on visualizing networks.  The chapter focused on showing these connections between adjacent words in the text, and proving how it was not an effective visualization technique.  I couldn’t agree more – this was the wrong application for it.</p>
<p>The book mapped the words in Mark Twain&#8217;s Huckleberry Finn, but because I don’t like replicating exact projects when I can avoid it, I decided to hop over to <a href="http://www.gutenberg.org/wiki/Main_Page" target="_blank">Project Gutenberg </a>and find something more interesting.  Upon searching for journalism, because why not, I found a book on “<a href="http://www.gutenberg.org/dirs/etext05/8jrnw10h.htm" target="_blank">Journalism and Women</a>,” written by a man.  This piece is worth a post of its own, in trying to supposedly help women.  Parts are condescending, and some advice is more relevant today than ever, but for both genders.  Today&#8217;s subject isn&#8217;t the book&#8217;s content as a whole, but the connections between words to see if that helped to illuminate certain points.  Hint: It didn&#8217;t, because this was a failed experiment.</p>
<p>But I learned the skills of how to use a network map to visualize connections, which could be valuable in the future.  It&#8217;s really difficult to make sense of thousands, and even hundreds of nodes.  I think this type of visualization demands a strong interactive component so the user can make sense of what’s happening, and a filtering mechanism would be even better.</p>
<p>As I was writing this, Rich sent me a <a href="http://slatest.slate.com/features/news_dots/default.htm" target="_blank">link</a> to a Slate visualization of how news stories connect to each other &#8212; &#8220;News Dots&#8221;. Great content, and the interface works, using different colors for different categories, having many different levels of filtering and exploration.  You can even go back and look at other days.  The more you roll over, the more you click, the more you want to explore, but I can understand what’s important at a glance.  I just got sidetracked for 20 minutes in deep exploration. Which is the mark of good work, in my eyes.</p>
<p>Back to my visualization.  Here’s the <a href="http://michelleminkoff.com/women_journalism/output.pdf" target="_blank">final product</a>, which I coded to be output as a PDF, it looks like a jumbled mess.  The bigger the bubble, the more time the word appears.  And there&#8217;s still so many common words with no meaning that the visualization is started to represent how many times &#8220;than&#8221; was used, utterly unhelpful.  (There’s 126 pages worth of connections by the way, when you force the program to output them, and that’s eliminating duplicate nodes.)  A user can’t get much out of this, no longer how long he or she stares at it, other than the fact that there are a lot of complicated connections.</p>
<p>The book then instructed us to try to make more sense of the information by bringing it into the open source <a href="http://www.graphviz.org/" target="_blank">Graphviz</a> program. It looks at and sorts the nodes hierarchically.  The program froze on my 126 pages of connections.  When I limited it to 10 pages, the first two chapters of my book, it produced <a href="http://www.michelleminkoff.com/womenjourn2.gif" target="_blank">this rather complicated gif file</a> &#8212; which has a significantly long load time.</p>
<p>It makes a little more sense, but frankly not much, but at least you can see the individual nodes. And if you zoom in, you can scroll around, which is an interesting way to provide interactivity within a very large static graphic.  But this is nowhere near ready for professional publication.</p>
<p>This section has taught me a lot about the importance of optimizing complicated visualizations, probably by coding it so only some parts load at once.</p>
<p>The chapter also took a look at using networking the right way by looking at a Web server and instead of individual nodes letting the data create a shape. The shape depicts a network, but without the same amount of detail. It works because it doesn’t overwhelm the reader.  But being as I have no Web server log of my own – I have to dig into the innards of my GoDaddy site, or I might be able to do it with my Django server once I get a project up and running in the coming week or so – I opted to treat it as an interesting reading exercise, but not one to put into practice, for now anyway.</p>
<h3  class="related_post_title">Related posts you might enjoy:</h3><ul class="related_post"><li>February 15, 2010 -- <a href="http://michelleminkoff.com/2010/02/15/striving-toward-visual-storytelling-not-just-presentation/" title="Visual confections are more than mere presentation">Visual confections are more than mere presentation</a></li><li>February 12, 2010 -- <a href="http://michelleminkoff.com/2010/02/12/data-delver-william-hartnett-palm-beach-post/" title="Data Delver: William Hartnett, Palm Beach Post">Data Delver: William Hartnett, Palm Beach Post</a></li><li>January 11, 2010 -- <a href="http://michelleminkoff.com/2010/01/11/importance-of-combining-data-analysis-with-context-reflections-on-readings-from-week-two/" title="Importance of combining data analysis with context (reflections on readings from week two)">Importance of combining data analysis with context (reflections on readings from week two)</a></li><li>March 8, 2010 -- <a href="http://michelleminkoff.com/2010/03/08/data-delver-jennifer-lafleur-propublica/" title="Data Delver: Jennifer LaFleur, ProPublica">Data Delver: Jennifer LaFleur, ProPublica</a></li><li>March 2, 2010 -- <a href="http://michelleminkoff.com/2010/03/02/guest-post-better-integrating-data-in-our-newsroom-culture/" title="Guest Post: Integrating data with our journalism">Guest Post: Integrating data with our journalism</a></li><li>February 21, 2010 -- <a href="http://michelleminkoff.com/2010/02/21/text-file-or-database/" title="Is a flat text file or a database right for an app?">Is a flat text file or a database right for an app?</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://michelleminkoff.com/2010/02/15/visualizing-networking-when-it-doesnt-work/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Treemapping Gov. Quinn&#8217;s State of the State</title>
		<link>http://michelleminkoff.com/2010/02/09/treemapping-gov-quinns-state-of-the-state/</link>
		<comments>http://michelleminkoff.com/2010/02/09/treemapping-gov-quinns-state-of-the-state/#comments</comments>
		<pubDate>Tue, 09 Feb 2010 16:04:19 +0000</pubDate>
		<dc:creator>Michelle Minkoff</dc:creator>
				<category><![CDATA[class]]></category>
		<category><![CDATA[data visualizations]]></category>
		<category><![CDATA[processing]]></category>
		<category><![CDATA[boxes]]></category>
		<category><![CDATA[budget]]></category>
		<category><![CDATA[dynamic project]]></category>
		<category><![CDATA[eyeballs]]></category>
		<category><![CDATA[governor]]></category>
		<category><![CDATA[hierarchies]]></category>
		<category><![CDATA[illinois]]></category>
		<category><![CDATA[journalism]]></category>
		<category><![CDATA[map]]></category>
		<category><![CDATA[ny times]]></category>
		<category><![CDATA[pat quinn]]></category>
		<category><![CDATA[reporting tool]]></category>
		<category><![CDATA[squares]]></category>
		<category><![CDATA[state of the state address]]></category>
		<category><![CDATA[treemap]]></category>
		<category><![CDATA[trees]]></category>
		<category><![CDATA[unstructured text]]></category>
		<category><![CDATA[zoom]]></category>

		<guid isPermaLink="false">http://michelleminkoff.com/?p=540</guid>
		<description><![CDATA[This week in the Processing book, I learned all about trees and hierarchies. There&#8217;s a lot of potential here for allowing the user to delve deeper into interactives by providing multiple layers. This is a very cool example of how programming helps support my theory of journalism &#8212; the deeper the information you offer, the [...]]]></description>
			<content:encoded><![CDATA[<p>This week in the Processing book, I learned all about trees and hierarchies.  There&#8217;s a lot of potential here for allowing the user to delve deeper into interactives by providing multiple layers.  This is a very cool example of how programming helps support my theory of journalism &#8212; the deeper the information you offer, the more the user has to explore. You&#8217;re conveying knowledge, and if you&#8217;re holding interest, you&#8217;re keeping eyeballs on the site. But treemapping is most valuable as a reporting tool, I think, making structured data out of unstructured text.</p>
<p>This <a href="http://michelleminkoff.com/state_of_the_state/applet/index.html">example project</a> is a static map of Gov. Quinn&#8217;s State of the State <a href="http://www.google.com/url?sa=t&#038;source=web&#038;ct=html&#038;cd=1&#038;ved=undefined&#038;url=http%3A%2F%2F74.125.95.132%2Fsearch%3Fq%3Dcache%3Aq_y9SryfCvgJ%3Awww.illinois.gov%2Fpublicincludes%2Fstatehome%2Fgov%2Fdocuments%2F2010%252520SOS%252520Transcript.PDF%2Bquinn%2Bstate%2Bof%2Bthe%2Bstate%2Btranscript%26cd%3D1%26hl%3Den%26ct%3Dclnk%26gl%3Dus%26client%3Dfirefox-a&#038;ei=z4BxS9XVHovKNbzTzO0J&#038;usg=AFQjCNF81krBpWBro2A0C_JUi3lhR6ErBg&#038;sig2=gW1-8jGZEuzW93VvdX6Frg">address</a>.  The bigger the square containing the word, the more times it appeared.  I told the program not to include &#8220;the,&#8221; &#8220;an,&#8221; &#8220;a&#8221;, or &#8220;of&#8221;, because then it&#8217;s just a map of large squares of articles.  But the more articles you tell it to delete, the more you find more of them.  Note to self: Make a list of all articles to exclude from this sort of thing in the future.  </p>
<p>I thought it was interesting how few times the word &#8220;state&#8221; actually came up in the speech.  It&#8217;s important to remember that even when we see words contained within small boxes, they are still fairly significant.  The insignificantly small words are those contained in boxes that are too small to contain letters.</p>
<p>It would be an even better project if you could click on the small boxes to zoom in, in the style of that <a href="http://www.nytimes.com/interactive/2010/02/01/us/budget.html">fabulous NY Times budget interactive</a> from a little while back.</p>
<p>I also made a dynamic treemap that details the structure of a folder on your file system. The dynamic project is of course more interesting, but the way it works, it depends on being loaded on your computer, so I&#8217;m holding off from posting it online.</p>
<h3  class="related_post_title">Related posts you might enjoy:</h3><ul class="related_post"><li>February 1, 2010 -- <a href="http://michelleminkoff.com/2010/02/01/relating-zip-codes-and-geography-using-processing/" title="Relating zip codes and geography using Processing">Relating zip codes and geography using Processing</a></li><li>February 13, 2010 -- <a href="http://michelleminkoff.com/2010/02/13/mo-tamman-wall-street-journal/" title="Data Delver: Mo Tamman, Wall Street Journal">Data Delver: Mo Tamman, Wall Street Journal</a></li><li>May 3, 2010 -- <a href="http://michelleminkoff.com/2010/05/03/real-world-coding-lessons/" title="Note to self: Real world journo-coding lessons">Note to self: Real world journo-coding lessons</a></li><li>April 7, 2010 -- <a href="http://michelleminkoff.com/2010/04/07/columbias-joint-ms-good-start-panacea/" title="Columbia&#8217;s new joint MS: Good start, but no panacea">Columbia&#8217;s new joint MS: Good start, but no panacea</a></li><li>March 29, 2010 -- <a href="http://michelleminkoff.com/2010/03/29/using-javascript-for-interactive-google-charts/" title="Using Javascript for interactive Google charts">Using Javascript for interactive Google charts</a></li><li>March 28, 2010 -- <a href="http://michelleminkoff.com/2010/03/28/data-delver-mark-schaver-louisville-courier/" title="Data Delver: Mark Schaver, Louisville Courier">Data Delver: Mark Schaver, Louisville Courier</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://michelleminkoff.com/2010/02/09/treemapping-gov-quinns-state-of-the-state/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Relating zip codes and geography using Processing</title>
		<link>http://michelleminkoff.com/2010/02/01/relating-zip-codes-and-geography-using-processing/</link>
		<comments>http://michelleminkoff.com/2010/02/01/relating-zip-codes-and-geography-using-processing/#comments</comments>
		<pubDate>Mon, 01 Feb 2010 23:54:08 +0000</pubDate>
		<dc:creator>Michelle Minkoff</dc:creator>
				<category><![CDATA[class]]></category>
		<category><![CDATA[data visualizations]]></category>
		<category><![CDATA[processing]]></category>
		<category><![CDATA[fry]]></category>
		<category><![CDATA[interactive features]]></category>
		<category><![CDATA[journalists]]></category>
		<category><![CDATA[map]]></category>
		<category><![CDATA[note to self]]></category>
		<category><![CDATA[nuances]]></category>
		<category><![CDATA[population density map]]></category>
		<category><![CDATA[precise location]]></category>
		<category><![CDATA[reporting tool]]></category>
		<category><![CDATA[scatterplot]]></category>
		<category><![CDATA[spreadsheet]]></category>
		<category><![CDATA[t news]]></category>
		<category><![CDATA[visualization]]></category>
		<category><![CDATA[visualizing data]]></category>
		<category><![CDATA[zip code]]></category>
		<category><![CDATA[zip codes]]></category>
		<category><![CDATA[zipdecode]]></category>
		<category><![CDATA[zoom]]></category>

		<guid isPermaLink="false">http://michelleminkoff.com/?p=510</guid>
		<description><![CDATA[This week I tackled recreating Ben Fry&#8217;s Zipdecode project, which he gives great step-by-step instructions for in his Visualizing Data book that I have been following along with this quarter. It&#8217;s an interesting take on the concept of the scatterplot, even before using its interactive features, it asserts its usefulness as a population density map. [...]]]></description>
			<content:encoded><![CDATA[<p>This week I tackled recreating Ben Fry&#8217;s <a href="http://benfry.com/zipdecode/">Zipdecode</a> project, which he gives great step-by-step instructions for in his Visualizing Data book that I have been following along with this quarter.</p>
<p>It&#8217;s an interesting take on the concept of the scatterplot, even before using its interactive features, it asserts its usefulness as a population density map.  Each dot represents a different zip code, and the more zip codes an area requires, the greater its population. This visualization isn&#8217;t news per se, but could be a valuable reporting tool, helping journalists to understand how visualizations are geographically related, and helping them find the precise location of where a certain source is located.<span id="more-510"></span></p>
<p>I take no creative credit for the project that emerged below.  But I did learn a few things about the Processing language and coding that I thought might be useful to note.  Probably more useful as a note to self than anything else, but here goes.</p>
<ol>
<li>If you&#8217;re unable to find a class, you may think it&#8217;s because you are missing a bracket within that class.  But you could be missing a bracket in another class, which could affect other code.  It took me much longer than it should have to figure that out.</li>
<li>You can clean a spreadsheet or database efficiently by using a program to adjust small nuances.  But sometimes the time it takes to code is longer than it would take for you to do it in a non-programatic way.  If the question is making it easier for the computer to parse, then you should let a program make that change. If it&#8217;s a question of making it easier on your eyes, there may be times when that isn&#8217;t as necessary.  Or maybe that&#8217;s just me.</li>
<li>Little nuances can take a lot longer than you think.  I kept fussing with the placement of textual instructions, wondering if it was clear enough where to click and when.  Sometimes simple is better, I think just having the word &#8220;Zoom&#8221; for people to click on is clearer than a three-level sliding scale in this case.</li>
<li>Embedding Processing applets in WordPress is buggier than it should be.  I&#8217;m unhappy that the visualization displays as a big gray box.  But I couldn&#8217;t see a way to put text on that gray box saying &#8220;Click in this box to begin,&#8221; because that blank box doesn&#8217;t appear when I run the visualization directly on my computer, it just appears as it should.  It&#8217;s a little better in HTML, so I&#8217;ll be linking instead of embedding from here forward. You can find today&#8217;s visualization <a href="http://michelleminkoff.com/minkoff_zipcode/applet/zipcode.html" target="_blank">here</a>.</li>
</ol>
<h3  class="related_post_title">Related posts you might enjoy:</h3><ul class="related_post"><li>February 21, 2010 -- <a href="http://michelleminkoff.com/2010/02/21/text-file-or-database/" title="Is a flat text file or a database right for an app?">Is a flat text file or a database right for an app?</a></li><li>February 9, 2010 -- <a href="http://michelleminkoff.com/2010/02/09/treemapping-gov-quinns-state-of-the-state/" title="Treemapping Gov. Quinn&#8217;s State of the State">Treemapping Gov. Quinn&#8217;s State of the State</a></li><li>January 10, 2010 -- <a href="http://michelleminkoff.com/2010/01/10/which-states-have-been-hit-hardest-by-unemployment-in-the-last-decade/" title="Which states have been hit hardest by unemployment in the last decade?">Which states have been hit hardest by unemployment in the last decade?</a></li><li>February 28, 2010 -- <a href="http://michelleminkoff.com/2010/02/28/django-app-2-conquering-forms-and-the-google-api/" title="Django app #2: Conquering forms and Google Maps API">Django app #2: Conquering forms and Google Maps API</a></li><li>February 15, 2010 -- <a href="http://michelleminkoff.com/2010/02/15/data-delver-lisa-pickoff-white/" title="Data Delver: Lisa Pickoff-White, California Watch">Data Delver: Lisa Pickoff-White, California Watch</a></li><li>February 13, 2010 -- <a href="http://michelleminkoff.com/2010/02/13/mo-tamman-wall-street-journal/" title="Data Delver: Mo Tamman, Wall Street Journal">Data Delver: Mo Tamman, Wall Street Journal</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://michelleminkoff.com/2010/02/01/relating-zip-codes-and-geography-using-processing/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Changes in the numbers of students majoring in programming and social sciences</title>
		<link>http://michelleminkoff.com/2010/01/19/majors/</link>
		<comments>http://michelleminkoff.com/2010/01/19/majors/#comments</comments>
		<pubDate>Tue, 19 Jan 2010 12:13:11 +0000</pubDate>
		<dc:creator>Michelle Minkoff</dc:creator>
				<category><![CDATA[CAR]]></category>
		<category><![CDATA[class]]></category>
		<category><![CDATA[data visualizations]]></category>
		<category><![CDATA[processing]]></category>
		<category><![CDATA[argonne]]></category>
		<category><![CDATA[car data]]></category>
		<category><![CDATA[comp sci majors]]></category>
		<category><![CDATA[computer assisted reporting]]></category>
		<category><![CDATA[computer science degrees]]></category>
		<category><![CDATA[computer scientist]]></category>
		<category><![CDATA[databases]]></category>
		<category><![CDATA[distinction]]></category>
		<category><![CDATA[future project]]></category>
		<category><![CDATA[gender breakdown]]></category>
		<category><![CDATA[gender race]]></category>
		<category><![CDATA[getting computer]]></category>
		<category><![CDATA[journalist]]></category>
		<category><![CDATA[journalists]]></category>
		<category><![CDATA[national science foundation]]></category>
		<category><![CDATA[point questions]]></category>
		<category><![CDATA[programmer]]></category>
		<category><![CDATA[school of journalism]]></category>
		<category><![CDATA[social science degrees]]></category>
		<category><![CDATA[traditional path]]></category>
		<category><![CDATA[wheels]]></category>

		<guid isPermaLink="false">http://michelleminkoff.com/?p=376</guid>
		<description><![CDATA[What is a traditional path to programming nowadays? It&#8217;s a question I&#8217;ve been thinking about a lot, esp. in the realm of the journalist-programmer. So many people from the older school of journalism came through using databases as tools to help with reporting, not because they took a class in it. That&#8217;s certainly encouraging for [...]]]></description>
			<content:encoded><![CDATA[<p>What is a traditional path to programming nowadays?  It&#8217;s a question I&#8217;ve been thinking about a lot, esp. in the realm of the journalist-programmer.  So many people from the older school of journalism came through using databases as tools to help with reporting, not because they took a class in it.  That&#8217;s certainly encouraging for someone like me.</p>
<p>But that started the wheels turning in my brain &#8212; Are people still getting computer science degrees?  Is the major more prevalent now than 5 years ago? 20 years ago?  How do the number of comp. sci. majors compare to..social science, let&#8217;s say, another field that dovetails nicely with CAR (and yet another one I didn&#8217;t major in, but that&#8217;s beside the point.)</p>
<p><APPLET name="sketch_081001a" WIDTH="700" HEIGHT="400" archive="http://www.michelleminkoff.com/minkoff_time_series_1/applet/minkoff_time_series_1.jar" standby="Loading Processing software…" codebase="http://www.michelleminkoff.com/minkoff_time_series_1/" code="minkoff_time_series_1" mayscript="true" scriptable="true" image="http://pappmaskin.no/opensource/processing/sketch_081001a/loading.gif" boxmessage="Loading Processing software…" boxcolor="#FFFFFF"><IMG SRC="http://pappmaskin.no/opensource/processing/sketch_081001a/loading.gif" WIDTH="1" HEIGHT="1" /></APPLET> </p>
<p><span id="more-376"></span></p>
<p>Questions like these are always good candidates for my weekly Processing projects.  This week &#8212; time-related data!  I went to the National Science Foundation, where they track the student getting various degrees in various fields.  It&#8217;s broken down in lots of interesting ways, across gender, race, state, field of interest in the sciences, etc.  A future project is the gender breakdown &#8212; where are all the female CAR/data/programmer journalists?  I know a few, but still&#8230;we&#8217;re in my minority.  It&#8217;s not surprising, I remember my own experiences growing up as the daughter of an Argonne computer scientist, and there weren&#8217;t many women around that building in the &#8217;90s either, and there still aren&#8217;t as many as I would like to see.</p>
<p>Back to the point&#8230;.For this project, I decided to look at the <a href="http://www.nsf.gov/statistics/nsf08321/content.cfm?pub_id=3785&amp;id=2">number of computer science degrees granted nationwide, across 40 years</a>, from 1966 to 2006 (the most recent data available).  I compared this to the same numbers for engineering and social science.  The idea being to compare CS to one very technical applied field, and one more theoretical one, but all related to data in some capacity.  I was amazed to see just how much more popular social science degrees were across the board, I hadn&#8217;t realized the distinction was that stark.</p>
<p>I also took note of the fact that the number of computer science degrees were higher in 2006 than ever before, but not that much higher than the dot-com boom in the late 1990s.  But overall, there&#8217;s an upward trend.  Not everyone&#8217;s going the self-taught route, although I imagine those with degrees must still self-teach, because you can&#8217;t possibly be taught every language that will be invented during your working life.</p>
<p>In terms of creating the actual project itself, I struggled with getting the data in an acceptable format for Processing.  Excel was spitting out a text-delimited file, which is apparently distinct from a text-separated values file.  Nuances, nuances.  And then, I left the commas in the middle of numbers. It looked right to my eye, but there was no way a computer was going to understand that as a float.  Note for next time!</p>
<p>I&#8217;m not completely happy with this final product.  The code is taken almost verbatim from Ben Fry&#8217;s &#8220;Visualizing Data&#8221; Processing book (which I highly recommend).  And for this type of comparison, I would have preferred a way to place all three graphs next to each other, so you could look at the data for engineering and CS degrees side-by-side.  The y axis label is also a bit close to the numbers.</p>
<p>But what I am proud of is that I was able to pull off a tabbed graph.  This is what the lovely Tribune graphics people did for our <a href="http://www.chicagotribune.com/health/chi-091218-lobbyingdatabase-html,0,4298069.htmlpage">joint health care lobbying project</a>, and at the time I remember thinking that it must be really tough to do.  Now, I understand how it&#8217;s done, and was able to do it with a book holding my hand while I did.  Next time, I&#8217;ll be even more self-sufficient.</p>
<p>If anyone wants encouragement for embarking on the journey of learning programming, I would argue this is a darn good reason. That feeling of satisfaction when you can make things happen, and truly become master of the machine, is simply amazing.  What seems like magic, and impossible, becomes real and tangible.  It&#8217;s just another tool in your arsenal.  And then you use your journalistic creativity to think of how best to use it.  How cool is that?</p>
<h3  class="related_post_title">Related posts you might enjoy:</h3><ul class="related_post"><li>January 10, 2010 -- <a href="http://michelleminkoff.com/2010/01/10/which-states-have-been-hit-hardest-by-unemployment-in-the-last-decade/" title="Which states have been hit hardest by unemployment in the last decade?">Which states have been hit hardest by unemployment in the last decade?</a></li><li>March 8, 2010 -- <a href="http://michelleminkoff.com/2010/03/08/data-delver-jennifer-lafleur-propublica/" title="Data Delver: Jennifer LaFleur, ProPublica">Data Delver: Jennifer LaFleur, ProPublica</a></li><li>March 2, 2010 -- <a href="http://michelleminkoff.com/2010/03/02/guest-post-better-integrating-data-in-our-newsroom-culture/" title="Guest Post: Integrating data with our journalism">Guest Post: Integrating data with our journalism</a></li><li>February 12, 2010 -- <a href="http://michelleminkoff.com/2010/02/12/data-delver-william-hartnett-palm-beach-post/" title="Data Delver: William Hartnett, Palm Beach Post">Data Delver: William Hartnett, Palm Beach Post</a></li><li>January 12, 2010 -- <a href="http://michelleminkoff.com/2010/01/12/data-delver-ted-mellnik-charlotte-observer-database-editor/" title="Data Delver: Ted Mellnik, Charlotte Observer database editor">Data Delver: Ted Mellnik, Charlotte Observer database editor</a></li><li>January 11, 2010 -- <a href="http://michelleminkoff.com/2010/01/11/importance-of-combining-data-analysis-with-context-reflections-on-readings-from-week-two/" title="Importance of combining data analysis with context (reflections on readings from week two)">Importance of combining data analysis with context (reflections on readings from week two)</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://michelleminkoff.com/2010/01/19/majors/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Reflections on Visualization Theory (Data viz readings, week 1)</title>
		<link>http://michelleminkoff.com/2010/01/04/viz-week-1/</link>
		<comments>http://michelleminkoff.com/2010/01/04/viz-week-1/#comments</comments>
		<pubDate>Mon, 04 Jan 2010 23:32:33 +0000</pubDate>
		<dc:creator>Michelle Minkoff</dc:creator>
				<category><![CDATA[class]]></category>
		<category><![CDATA[data visualizations]]></category>
		<category><![CDATA[processing]]></category>
		<category><![CDATA[tufte]]></category>

		<guid isPermaLink="false">http://michelleminkoff.com/?p=136</guid>
		<description><![CDATA[In this first set of reading, I learn that the principles of simplicity, accuracy and more are as true in data visualization as they are in a text story. Edward Tufte, Visual Explanations, &#8220;Images and Quantities&#8221; In Tufte&#8217;s first paragraph of Visual Explanations, he discusses the importance of readability &#8212; a concept I see as [...]]]></description>
			<content:encoded><![CDATA[<p>In this first set of reading, I learn that the principles of simplicity, accuracy and more are as true in data visualization as they are in a text story.</p>
<p><strong>Edward Tufte, <em>Visual Explanations</em>, &#8220;Images and Quantities&#8221;</strong></p>
<p>In Tufte&#8217;s first paragraph of Visual Explanations, he discusses the importance of readability &#8212; a concept I see as having a strong parallel to the usability so often discussed in a more modern era.  He breaks down it down into three types of depicting quantities: direct labels, encodings (scales of color) and self-representing scales. I like his breakdown, as I&#8217;ve seen examples of all of theses, whether on news sites, or t0 illustrate points in scientific journals when I was doing medical reporting.  I attempted to do an encoding on my recent border crossing graph, but was unhappy that using color intensity to express data made everything so light that it was difficult to distinguish between colors. I would add the caution that encodings are best used when the entire graph is just representing various intensities of that one variable.</p>
<p>Tufte also writes that maps are just another type of graph.  Statistical graphics are those that don&#8217;t just give the data, but spatially arrange the data on a straight one-dimensional line.   No comment here, except to say that so far everything makes logical sense.  And the main takeaway seems to be a maxim true in written journalism as well, &#8220;Keep it simple, stupid.&#8221;</p>
<p>At the same time, we should make sure to include all the facts and necessary details, which means including labels when they are not self-explanatory.  Tufte critiques a computer visualization for looking pretty, but not putting its data into the proper context.  Another maxim: Content is king.  Without interesting information, the coolest icons, colors and animations do nothing.  It&#8217;s nice to hear that it&#8217;s a professional point, but before I would have just clicked off of something I didn&#8217;t understand.  And in the news business, or any business, that&#8217;s not something we want.<span id="more-136"></span></p>
<p><strong>Ben Fry, <em>Visualizing Data</em>, &#8220;The Seven Stages of Visualizing Data&#8221;</strong></p>
<p>Here&#8217;s Fry&#8217;s list:</p>
<ol>
<li>Acquire</li>
<li>Parse</li>
<li>Filter</li>
<li>Mine</li>
<li>Represent</li>
<li>Refine</li>
<li>Interact</li>
</ol>
<p>All of these could probably be applied to writing as well.  I think it&#8217;s interesting that the first six of these elements are equivalent to reporting, and figuring out which quotes to use.  It strikes me that in both text and graphic work the actual creation is the end-step, but if you&#8217;ve done the previous work well, the piece should just write (or draw, or code) itself.  Fascinating.</p>
<p>One part that&#8217;s different about interactive data visualizations is the user&#8217;s ability to manipulate. This is a major focus of Fry&#8217;s introductory chapter.  Instead of guiding them through a story by taking them where you want them to go (they read your words in your order), here it&#8217;s your job to set up so they can make their own discoveries.  Creating visualizations for the Internet is all about putting the power back in the hands of the user.  Sometimes with text, I feel it&#8217;s a bit too easy to become a bit power-happy, after all, you&#8217;re telling the reader what they need to know.  Part of what I love about interactivity is that it&#8217;s democracy at its finest.  User, here&#8217;s the data so you can understand it, but there&#8217;s a nearly infinite number of ways you can explore it. Have it, and revel in the true sense of democracy.  That&#8217;s not withstanding the issue of whether the Internet is really democratic because not everyone has access, but that&#8217;s a topic for another time. Letting users drive our stories to a greater extent brings us back to why journalism was started, and its relevance goes beyond interactive visualizations, and provides a lesson I take to heart as a journalist, no matter what platform I&#8217;m using.</p>
<h3  class="related_post_title">Related posts you might enjoy:</h3><ul class="related_post"><li>March 8, 2010 -- <a href="http://michelleminkoff.com/2010/03/08/piece-de-resistance-data-viz-wrapup/" title="Piece de resistance: Data viz wrapup">Piece de resistance: Data viz wrapup</a></li><li>February 15, 2010 -- <a href="http://michelleminkoff.com/2010/02/15/striving-toward-visual-storytelling-not-just-presentation/" title="Visual confections are more than mere presentation">Visual confections are more than mere presentation</a></li><li>January 19, 2010 -- <a href="http://michelleminkoff.com/2010/01/19/majors/" title="Changes in the numbers of students majoring in programming and social sciences">Changes in the numbers of students majoring in programming and social sciences</a></li><li>January 11, 2010 -- <a href="http://michelleminkoff.com/2010/01/11/importance-of-combining-data-analysis-with-context-reflections-on-readings-from-week-two/" title="Importance of combining data analysis with context (reflections on readings from week two)">Importance of combining data analysis with context (reflections on readings from week two)</a></li><li>January 10, 2010 -- <a href="http://michelleminkoff.com/2010/01/10/which-states-have-been-hit-hardest-by-unemployment-in-the-last-decade/" title="Which states have been hit hardest by unemployment in the last decade?">Which states have been hit hardest by unemployment in the last decade?</a></li><li>February 9, 2010 -- <a href="http://michelleminkoff.com/2010/02/09/repetition-repetition-the-power-of-multiples/" title="Repetition, repetition: The power of multiples">Repetition, repetition: The power of multiples</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://michelleminkoff.com/2010/01/04/viz-week-1/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

