<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Michelle Minkoff &#187; sql</title>
	<atom:link href="http://michelleminkoff.com/category/sql/feed/" rel="self" type="application/rss+xml" />
	<link>http://michelleminkoff.com</link>
	<description>=SUM (Passion + journalism + data + technology)</description>
	<lastBuildDate>Wed, 16 May 2012 16:29:33 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
		<item>
		<title>Is a flat text file or a database right for an app?</title>
		<link>http://michelleminkoff.com/2010/02/21/text-file-or-database/</link>
		<comments>http://michelleminkoff.com/2010/02/21/text-file-or-database/#comments</comments>
		<pubDate>Sun, 21 Feb 2010 18:08:01 +0000</pubDate>
		<dc:creator>Michelle Minkoff</dc:creator>
				<category><![CDATA[CAR]]></category>
		<category><![CDATA[data visualizations]]></category>
		<category><![CDATA[processing]]></category>
		<category><![CDATA[sql]]></category>
		<category><![CDATA[aggregate]]></category>
		<category><![CDATA[application]]></category>
		<category><![CDATA[approaching the end]]></category>
		<category><![CDATA[art galleries]]></category>
		<category><![CDATA[bubble chart]]></category>
		<category><![CDATA[chicago artists]]></category>
		<category><![CDATA[city limits]]></category>
		<category><![CDATA[computer asssisted reporting]]></category>
		<category><![CDATA[current issue]]></category>
		<category><![CDATA[data visualization]]></category>
		<category><![CDATA[database]]></category>
		<category><![CDATA[flat text file]]></category>
		<category><![CDATA[independent study]]></category>
		<category><![CDATA[interface]]></category>
		<category><![CDATA[metro chicago area]]></category>
		<category><![CDATA[neighborhood]]></category>
		<category><![CDATA[paintings]]></category>
		<category><![CDATA[pamphlet]]></category>
		<category><![CDATA[principle]]></category>
		<category><![CDATA[sculptures]]></category>
		<category><![CDATA[sql statements]]></category>
		<category><![CDATA[statistics]]></category>
		<category><![CDATA[text]]></category>
		<category><![CDATA[theory]]></category>
		<category><![CDATA[visualizing data]]></category>
		<category><![CDATA[zip code]]></category>

		<guid isPermaLink="false">http://michelleminkoff.com/?p=692</guid>
		<description><![CDATA[I&#8217;ve been busily coding away on my final project for my data visualization independent study. I&#8217;m looking at data that I&#8217;ve been slowly acquiring on the life of art galleries that were in Chicago in 1990.  Back then, the Chicago Artists&#8217; Coalition had a complete listing in a pamphlet (more like a book) they put [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve been busily coding away on my final project for my data visualization independent study. I&#8217;m looking at data that I&#8217;ve been slowly acquiring on the life of art galleries that were in Chicago in 1990.  Back then, the Chicago Artists&#8217; Coalition had a complete listing in a pamphlet (more like a book) they put out for artists.  It included galleries within the city, in the metro Chicago area, and even downstate.  I&#8217;m choosing to focus on ones that were within Chicago&#8217;s city limits in 1990, and I&#8217;m looking into what happened to them.  Ideally, the visualization will use the principle of a bubble chart to show how many galleries, of those 96, existed in 1990, 1995, 2000, 2005 and 2010.  You will also be able to filter by zip code, which sheds light on what neighborhood housed galleries that have lasted the longest, and perhaps more interestingly, by medium, showing how these statistics change if you look at galleries only specializing in paintings, or only in sculptures.<span id="more-692"></span></p>
<p>One part I&#8217;ve been slowly conquering is connecting my SQLite databse to the Processing application.  My current issue is closing the database, the query just runs and runs and runs and&#8230;you get the point.  I&#8217;m using a database because these counts need to be able to adjust depending on where you click, and I figure that SQL statements are the best way to filter and aggregate the various information.</p>
<p>But today, in my reading from Ben Fry&#8217;s Visualizing Data, which I&#8217;m approaching the end of, Fry was discussing acquiring data.  Many of the methods I&#8217;d come across before in my own research, they focused on how to hook up the database to my interface.  Fry suggested to reconsider whether you need a database, or can use programming to manipulate a flat text file.  With the way I&#8217;ve designed this application, I need to count up the number of records where a column meets a certain condition.  I was trying to do that with a count in SQL.  But I can just have the program do a loop, and count it programatically.</p>
<p>Perhaps using a database is a better fit when doing joins or complicated queries, or if one has millions, or even thousands, of records.  But if performing counts, and pulling records that meet certain conditions, is all that&#8217;s necessary, right now I&#8217;m thinking that a flat text file is best.</p>
<h3  class="related_post_title">Related posts you might enjoy:</h3><ul class="related_post"><li>February 15, 2010 -- <a href="http://michelleminkoff.com/2010/02/15/striving-toward-visual-storytelling-not-just-presentation/" title="Visual confections are more than mere presentation">Visual confections are more than mere presentation</a></li><li>February 1, 2010 -- <a href="http://michelleminkoff.com/2010/02/01/relating-zip-codes-and-geography-using-processing/" title="Relating zip codes and geography using Processing">Relating zip codes and geography using Processing</a></li><li>January 10, 2010 -- <a href="http://michelleminkoff.com/2010/01/10/which-states-have-been-hit-hardest-by-unemployment-in-the-last-decade/" title="Which states have been hit hardest by unemployment in the last decade?">Which states have been hit hardest by unemployment in the last decade?</a></li><li>March 3, 2010 -- <a href="http://michelleminkoff.com/2010/03/03/persistence-of-chicago-art-galleries/" title="Persistence of Chicago Art Galleries">Persistence of Chicago Art Galleries</a></li><li>February 28, 2010 -- <a href="http://michelleminkoff.com/2010/02/28/django-app-2-conquering-forms-and-the-google-api/" title="Django app #2: Conquering forms and Google Maps API">Django app #2: Conquering forms and Google Maps API</a></li><li>February 15, 2010 -- <a href="http://michelleminkoff.com/2010/02/15/visualizing-networking-when-it-doesnt-work/" title="Visualizing networking: When it doesn&#8217;t work">Visualizing networking: When it doesn&#8217;t work</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://michelleminkoff.com/2010/02/21/text-file-or-database/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>How to &#8220;Group By&#8221; in Excel</title>
		<link>http://michelleminkoff.com/2010/01/20/how-to-group-by-in-excel/</link>
		<comments>http://michelleminkoff.com/2010/01/20/how-to-group-by-in-excel/#comments</comments>
		<pubDate>Thu, 21 Jan 2010 00:17:05 +0000</pubDate>
		<dc:creator>Michelle Minkoff</dc:creator>
				<category><![CDATA[CAR]]></category>
		<category><![CDATA[excel]]></category>
		<category><![CDATA[sql]]></category>
		<category><![CDATA[databases]]></category>
		<category><![CDATA[group by]]></category>
		<category><![CDATA[random time]]></category>
		<category><![CDATA[unique records]]></category>

		<guid isPermaLink="false">http://michelleminkoff.com/?p=402</guid>
		<description><![CDATA[Random time-saving Excel tip.  Want to condense a long column to only display unique entries?  Go to the Data menu, select Filter and Advanced Filter.  Voila, a check box labeled &#8220;Unique records only&#8221;.  Don&#8217;t know why I never noticed this before. Or, you know, you could use Group By in SQL.  But no need to [...]]]></description>
			<content:encoded><![CDATA[<p>Random time-saving Excel tip.  Want to condense a long column to only display unique entries?  Go to the Data menu, select Filter and Advanced Filter.  Voila, a check box labeled &#8220;Unique records only&#8221;.  Don&#8217;t know why I never noticed this before.</p>
<p>Or, you know, you could use Group By in SQL.  But no need to import databases just for filtering.</p>
<h3  class="related_post_title">Related posts you might enjoy:</h3><ul class="related_post"><li>April 11, 2010 -- <a href="http://michelleminkoff.com/2010/04/11/data-delver-paul-monies-oklahoman/" title="Data Delver: Paul Monies, The Oklahoman">Data Delver: Paul Monies, The Oklahoman</a></li><li>March 25, 2010 -- <a href="http://michelleminkoff.com/2010/03/25/self-teaching-data-and-programming-skills/" title="Self-teaching data and programming skills">Self-teaching data and programming skills</a></li><li>March 8, 2010 -- <a href="http://michelleminkoff.com/2010/03/08/data-delver-jennifer-lafleur-propublica/" title="Data Delver: Jennifer LaFleur, ProPublica">Data Delver: Jennifer LaFleur, ProPublica</a></li><li>February 28, 2010 -- <a href="http://michelleminkoff.com/2010/02/28/django-app-2-conquering-forms-and-the-google-api/" title="Django app #2: Conquering forms and Google Maps API">Django app #2: Conquering forms and Google Maps API</a></li><li>February 21, 2010 -- <a href="http://michelleminkoff.com/2010/02/21/text-file-or-database/" title="Is a flat text file or a database right for an app?">Is a flat text file or a database right for an app?</a></li><li>February 2, 2010 -- <a href="http://michelleminkoff.com/2010/02/02/recognize-the-importance-of-journalistic-creativity/" title="Recognize the importance of journalistic creativity">Recognize the importance of journalistic creativity</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://michelleminkoff.com/2010/01/20/how-to-group-by-in-excel/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Collaborating with computers to parse &#8220;big data&#8221;</title>
		<link>http://michelleminkoff.com/2010/01/18/bigdata/</link>
		<comments>http://michelleminkoff.com/2010/01/18/bigdata/#comments</comments>
		<pubDate>Tue, 19 Jan 2010 04:47:32 +0000</pubDate>
		<dc:creator>Michelle Minkoff</dc:creator>
				<category><![CDATA[CAR]]></category>
		<category><![CDATA[sql]]></category>
		<category><![CDATA[theory]]></category>
		<category><![CDATA[web]]></category>
		<category><![CDATA[ace reporter]]></category>
		<category><![CDATA[achilles heel]]></category>
		<category><![CDATA[aspen institute]]></category>
		<category><![CDATA[believer]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[big picture]]></category>
		<category><![CDATA[cakewalk]]></category>
		<category><![CDATA[cause and effect]]></category>
		<category><![CDATA[colleague]]></category>
		<category><![CDATA[computer time]]></category>
		<category><![CDATA[computers]]></category>
		<category><![CDATA[correlation]]></category>
		<category><![CDATA[english term paper]]></category>
		<category><![CDATA[human brains]]></category>
		<category><![CDATA[knowledge gaps]]></category>
		<category><![CDATA[millisecond]]></category>
		<category><![CDATA[nuance]]></category>
		<category><![CDATA[page pdf]]></category>
		<category><![CDATA[peril]]></category>
		<category><![CDATA[statistical ethics]]></category>
		<category><![CDATA[story assignment]]></category>
		<category><![CDATA[travel time]]></category>

		<guid isPermaLink="false">http://michelleminkoff.com/?p=248</guid>
		<description><![CDATA[Picture it: You&#8217;ve been given a new story assignment, and you have to leave for the interview in five minutes. You&#8217;ve got to have enough background to ask the right questions, but there&#8217;s no time to do research.  Somehow pulling that all-nighter on an English term paper in college seems like a cakewalk. Luckily, you&#8217;re [...]]]></description>
			<content:encoded><![CDATA[<p>Picture it: You&#8217;ve been given a new story assignment, and you have to leave for the interview in five minutes.  You&#8217;ve got to have enough background to ask the right questions, but there&#8217;s no time to do research.  Somehow pulling that all-nighter on an English term paper in college seems like a cakewalk. Luckily, you&#8217;re a seasoned journalist (where I hope to be in 10 years), and you&#8217;re used to figuring it out fast.</p>
<p>Enter the new reporter.  He can come up with millions of questions in a millisecond, and assimilate information ten times faster than that.  He doesn&#8217;t take no for an answer, and he doesn&#8217;t even need to spend travel time on his interviews.  Oh, and he&#8217;s the pinnacle of efficiency &#8211; doesn&#8217;t take lunch or bathroom breaks, doesn&#8217;t even stop to sleep.  He&#8217;s the new ace reporter on staff: a computer.</p>
<p>Time to be afraid?  No, because the computer has a serious Achilles heel &#8212; it has no flexibility, no understanding of nuance. It&#8217;s either black or white, right or wrong.  The best way to work with your new colleague, so to speak, is to let him focus on his strengths, while you focus on yours.  I&#8217;m talking about using the machine as an analytical tool for details, but leaving the big picture exploration to the humans.  This concept is buzzing around the Internet these days &#8212; the idea of parsing &#8220;big data&#8221; &#8212; data sets that are too big for human brains alone to comprehend.<span id="more-248"></span></p>
<p>This idea was explored in a <a href="http://www.aspeninstitute.org/sites/default/files/content/docs/pubs/InfoTec h09.pdf" target="_blank">large-scale report</a> (66-page-PDF ahead!)  from the Aspen Institute, titled &#8220;The Promise and Peril of Big Data.&#8221;  I&#8217;m not surprised by a title like that, having long been a believer that all technology is amoral, and like much of life, brings its own good and bad aspects.  The computer can examine these large sets, but we have to use our own judgment to bring journalistic and statistical ethics &#8212; just because there is a correlation, that doesn&#8217;t mean there is a cause and effect, for example.  But just as we go to expert sources to fill in our own knowledge gaps, the computer serves as a certain type of expert that can aid reporting.</p>
<p>Here&#8217;s the computer&#8217;s specialty:  It  helps us analyze information, and then display it so it&#8217;s fit for human consumption. This is the essence of journalism, it&#8217;s just that the computer can process more stuff than the human brain can.  But we must make sense of it.  He&#8217;ll take care of the trees, we have to deal with the forest.</p>
<p>As for statistical analysis, it&#8217;s been big in the business community for years. The idea is simple: Parse the information so it can help people make intelligent decisions. The field is progressing rapidly, and the community is talking about using data for storytelling.  Roger Magoulas, director of research for O&#8217;Reilly Media, recently <a href="http://radar.oreilly.com/2010/01/roger-magoulas-on-big-data.html?utm_source=feedburner&amp;utm_medium=email&amp;utm_campaign=Feed%3A+oreilly%2Fradar%2Fatom+%28O%27Reilly+Radar%29">said</a>, &#8220;You need more sophisticated ways of distilling what you know down.  A good visualization, if you turn something into a good story, it&#8217;s just going to resonate a lot more&#8230;than a simple chart that gets delivered to everyone.&#8221;</p>
<p>Wait, isn&#8217;t that what journalists do?  Only difference is that instead of making data accessible to a few decision makers, it&#8217;s about making data accessible to a bigger group of people.  Many call it data analysis, I call it opening up democracy.</p>
<h3  class="related_post_title">Related posts you might enjoy:</h3><ul class="related_post"><li>January 30, 2010 -- <a href="http://michelleminkoff.com/2010/01/30/parallelism-packing-information-into-visualization/" title="Parallelism: Packing information into visualization">Parallelism: Packing information into visualization</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://michelleminkoff.com/2010/01/18/bigdata/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

