Key Takeaways and Raw Responses to Chicago School of Data Census Form

As part of our post- Chicago School of Data Days work, we are doing lots of analysis of the data we’ve collected and the artifacts we created together.

One key dataset, especially for the consideration of gaps in data provision and skills development, are the answers to the census form we’ve been working on for months. We’ve got 246 responses to date. Here they are, with identifying information and end-matter (re: how they want to be contacted, participation in the project, etc.) removed.

Untitled

Takeaways

The census form was a key part of the Chicago School of Data project and the conference.  We took responses from this survey to better understand how data was used by organizations and responded by creating themes that defined the conference sessions and discussions.

Here are the themes that we came to:

Continue reading

Chicago Region’s Civic Innovation Community at the Code for America Summit!

This week is a great week for civic innovation—the Code for America Summit in San Francisco is here. Smart Chicago will be there in force. I will be there, as well as consultants Christopher Whitaker and Josh Kalov. (Along with about a dozen other representatives from Chicago’s civic hacking community)

2013_1015_071337 CfASummit

We’ll be live tweeting the event on our @SmartChicago account, but you can also follow along using the #CfAsummit hashtag. Below the fold, we’ve all the details of the Chicago area delegation.

Continue reading

The Launch of the Chicago School of Data Project

Smart Chicago has started work on the Chicago School of Data Project, which has three main components:

  • Convene a core group of practitioners in Chicago who are using data to improve the lives of regular residents
  • Document and map the landscape of data activity in Chicago— the entities, tasks, companies, enterprises, civil service organizations, and others who make up the field
  • Plan a region-wide event in early autumn where we will share this mapping work with the larger data community. We seek to showcase all of the activity underway through capacity-building workshops and demonstrations

From this project, we hope to develop a collaborative framework and tools for improving connections across the Chicago data ecosystem– the Chicago School of Data.

Matt Gee,  a respected leader in the Chicago data community, has been hired to lead this project.  Here’s a look at the work ahead:

  • Convene small-group discussions with key partners to help us frame the work and make sure that we see the entire discipline
  • Lead larger convenings of 20 – 30 people from a wider group of stakeholders to understand needs, identify opportunities, and plan for events
  • Organize a city-wide data census with volunteer data ambassadors canvassing organizations to understand what’s happening now
  • Define the scope, breadth, time period, venue, and zeitgeist of the event itself, in concert with the stakeholders
  • Review existing documents, including grant agreements to practitioners, blog posts from the field, evaluations of existing market activity,  the Urban Institute assessment, entries from our city-wide data census, and documentation of conversations conducted throughout the project
  • Define the landscape of data work in Chicago and compile a cohesive narrative that gives shape, direction, and clarity to all included
  • Recruit speakers, teachers, and panelists for the event and work with them on their content

This is a lot of work. It will only be of value if it is inclusive and exhaustive. If you think what we’re saying speaks to you— if you have any inkling that you use data to improve lives in Chicago— we want to hear from you. Even if we’re already deep partners, and talk to each other every day, please complete this form.

If you are interested in helping out on the project itself, we need people to conduct interviews and help others complete the form to get their voices heard. If you’re interested in helping on this, please let us know here.

Excerpt, Beyond Transparency, Building a Smarter Chicago: Capital and Products

For the past few Tuesdays, we’ve excerpted sections from Beyond Transparency: Open Data and the Future of Civic Innovation“, an anthology edited by Brett Goldstein with Lauren Dyson and published by Code for America.

I wrote a chapter titled, “Building a Smarter Chicago“, which I call “an illustrative, incomplete, and idiosyncratic look at the ecosystem in Chicago. It is meant to provide a thumbnail take on how the ecosystem developed here, while sparking fires elsewhere”. Here’s the last few sections, covering capital, products an the road ahead:

Capital: Philanthropy Leads, Capital Must Follow

Without money, there is no sustainability.

As an ecosystem matures, it finds ways to adapt and grow. In technology and data, growing means capital. In Chicago, a main source of capital currently comes from philanthropic sources, though there are some stirrings in the market.

The first open government data apps contest—Apps for Metro Chicago—was primarily funded by the MacArthur Foundation (O’Brien, 2011). The contest was an important moment in the ecosystem—it was the first time that government and developers were brought together in the context of a project with cash prizes.

The Smart Chicago Collaborative, a civic organization devoted to improving lives in Chicago through technology, is funded by the MacArthur Foundation and the Chicago Community Trust. Additional funding came through the federal government’s Broadband Technology Opportunities Program, a program designed to expand access and adoption of broadband opportunities in communities across America (National Telecommunications and Information Administration, n.d.).

EveryBlock was funded by a $1 million grant from the Knight Foundation, and then was acquired by MSNBC. This was a test of using philanthropic money and open source as a basis for a business. There have not been many examples since then. This is a problem that needs to be fixed—we need more experimentation, more value.

A digital startup hub in Chicago, known as 1871, has a number of civic startups in their space, including Smart Chicago, Tracklytics, Purple Binder, and Data Made. As these organizations deliver more value, the entire civic innovation sector will attract more capital.

Products: The Next Frontier

In order for the ecosystem to be self-sustaining, we have to create popular, scalable, and revenue-generating products with civic data.

Developers in Chicago are making a renewed focus on users. An example is the Civic User Testing Group run by Smart Chicago (Smart Chicago Collaborative, n.d.). We’ve spent years trying to get regular residents to participate in the product development process, and now we have more than five hundred people signed up in our first six months.

We have to do this—go beyond anecdote, beyond the cool app that lacks real traction, into creating business models and datasets that add value. We need to make products and services that people can’t live without.

This will require a mix of proprietary solutions, open source code, and shared standards. Companies need to follow viable product strategies—moving from one-off apps to sustainable systems. Interoperable data is a critical component to making this happen.

The good thing about this is that there are models to follow in other successful companies right here in Chicago. SitterCity is a vast consumer success story. OpenTable, Groupon, and GrubHub are all Chicago companies that found ways to reduce transaction friction in various markets.

They did this, in the main, with a strict attention to customers. In the civic innovation sector of the technology industry, we call those people “residents.” When you are serving people and make popular products, you are necessarily serving a civic need.

We’re beginning to focus on this work here in Chicago by adding value to civic data with unstructured public content, by creating systems around predictive analytics, and making baseline services, like Open311, that can serve future product needs.

What’s Your Ecosystem?

This is a short take on a complicated subject that, in the end, has to be completely local. Hopefully, it gives some specific examples of how we’ve built an open data ecosystem in Chicago and points to how far we have to go.

Chicago has contributed, in our small way, but we have to be measured by how we contribute to the entirety of the internet, rather than this civic innovation subset. We’re ready to keep going, and we’re excited to share our models with the rest of the country and the world.

Excerpt, Beyond Transparency, Building a Smarter Chicago: Civic Activism

For the past few Tuesdays, we’ve excerpting sections from Beyond Transparency: Open Data and the Future of Civic Innovation“, an anthology edited by Brett Goldstein with Lauren Dyson and published by Code for America.

I wrote a chapter titled, “Building a Smarter Chicago“, which I call “an illustrative, incomplete, and idiosyncratic look at the ecosystem in Chicago. It is meant to provide a thumbnail take on how the ecosystem developed here, while sparking fires elsewhere”. Here’s the third section, which covers our long history of civic activism:

Developers: Civic Activism

Every city has its own history and its own approach to the world, and I think that is expressed in its technological history as well. Chicago has been a center of civic activism and individual public creativity for decades.

It can be traced as far back as Jane Addams, who created the Hull House in 1889. It was the first “settlement house,” cooperative residences for middle-class “settlers” in predominantly immigrant neighborhoods that aimed to reduce inequality in urban areas (Wade, 2004). She was also a tireless scholar who studied the geographical distribution of typhoid fever and found that it was the working poor who suffered most from the illness.

Chicago is the place where the drive for common standards, like the eight-hour workday, was fought (Jentz, n.d.). It was a center for the battle against mortgage redlining (the practice of denying or raising prices for mortgages that has played a role in the decay of cities). Activists used data to understand the predicament and prove their case.

The General Transit Feed Specification (GTFS) is a recent national example of success in putting civic data to use for the public good. Everyone loves CTA bus tracker apps, but few people know that the installation of the GPS satellite technology making that possible is the result of a lawsuit brought by a group associated with the Americans Disabled for Accessible Public Transit (Chicago Transit Authority, n.d.). Their case, Access Living et al. v. Chicago Transit Authority, required “installation of audio-visual equipment on buses to announce bus stop information to riders who have visual impairments or are deaf or hard of hearing” (Equip for Equality, n.d.). When you hear the loudspeaker system announce the next street where the bus is stopping, you have de facto data activists to thank.

This is the place where saxophonists rise from the stage, blare out a ten-minute solo, and calmly fade back into the band. It’s the place where slam poetry was conceived—individual poets audaciously grabbing the mic for three minutes and getting judged by the crowd. It’s also where improv comedy—with its focus on ensemble and fast thinking—was invented.

These are threads for us in the civic innovation movement here in Chicago. I believe they’re embedded in the work. They form examples for us to follow—the quiet humility of the worker in the crowd, the developer among the people.

You can find recitations of particular apps using specific datasets anywhere. Just remember that every city has unique cultural and technological histories. This is the essence of an ecosystem, and it’s why they are local.

It’s one thing to recognize history and another to build a local movement from it. Here are some of the entities that have helped form and accelerate the work:

  • Illinois Data Exchange Affiliates was an early-incarnation open data group that led the way (Illinois Data Exchange Affiliates, 2007).
  • Independent Government Observers Task Force was a 2008 non-conference, where many of the leaders of the movement worked together (Independent Government Observers Task Force, 2008).
  • Open Government Chicago(-land) is a meetup group started by Joe Germuska (Open Government Chicago(-land), 2013).
  • Open Gov Hack Nights are weekly meetings that have been critical to accelerating the pace of development (Open Gov Hack Night, n.d.).
  • Digital.CityofChicago.org is a publication at the center of city policy and examples (“Release All the Data,” 2013).

Excerpt, Beyond Transparency, Building a Smarter Chicago: Enlightened Self-Interest Meets the Movement

For the next few Tuesdays, we are excerpting sections from Beyond Transparency: Open Data and the Future of Civic Innovation“, an anthology edited by Brett Goldstein with Lauren Dyson and published by Code for America.

I wrote a chapter titled, “Building a Smarter Chicago“, which I call “an illustrative, incomplete, and idiosyncratic look at the ecosystem in Chicago. It is meant to provide a thumbnail take on how the ecosystem developed here, while sparking fires elsewhere”. Here’s the second section, which covers the movement toward policy:

Policy: Enlightened Self-Interest Meets the Movement

As a co-founder of EveryBlock, I spent four years (2007 to 2011) working with sixteen municipalities on publishing data. I saw some fundamental patterns of open data policy development that held true here in Chicago.

First off, I can’t emphasize the power of examples enough. In December 2007, I was part of a meeting of open data advocates in Sebastopol, California. The mission was “to develop a more robust understanding of why open government data is essential to democracy.”

The output was the “8 Principles of Open Government Data” (Open Government Working Group, 2007). This simple document was a powerful, unimpeachable tool that I used every time I worked with government. It made a significant difference because it gave government-based open data advocates something to point to when they were in their internal meetings. This support of isolated pockets of policymakers was one important pattern I saw here in Chicago as well. Building relationships with public, sharable resources, like the “8 Principles,” allowed for shared trust and shared work. This pattern of template sharing is something that works.

There were nascent open data plans and products in the Daley administration, including Chicago Works For You, a project I worked on as a consultant for the City in 2005. Micah Sifry discussed this project in a 2009 article titled “A See-Through Society”:

People are eager for access to information, and public officials who try to stand in the way will discover that the internet responds to information suppression by routing around the problem. Consider the story of a site you’ve never seen, ChicagoWorksForYou.com. In June 2005, a team of Web developers working for the City of Chicago began developing a site that would take the fifty-five different kinds of service requests that flow into the city’s 311 database—items like pothole repairs, tree-trimming, garbage-can placement, building permits, and restaurant inspections—and enable users to search by address and “map what’s happening in your neighborhood.” The idea was to showcase city services at the local level. (Sifry, 2009)

Early failures often lead the way to the next policy win—that’s another pattern.

Hot topics that receive public attention are fecund areas for open data policy. In Chicago, Tax Increment Financing is a big topic, mainly because it has been an opaque financial instrument, handling huge amounts of money with very little public information about how the system works.

It’s no accident that a number of Aldermen sponsored the TIF Sunshine Ordinance in 2009 (Brooks & O’Neil, 2009). Pressure and heat get results.

The last pattern has perhaps led to the most good: when the chief executive of a unit of government wants to make a big push. Mayor Michael Bloomberg of New York won an unusual third term at the same time he pushed for BigApps; San Francisco Mayor Gavin Newsom was planning a run for governor at the same time he worked to open DataSF; and our own Mayor Rahm Emanuel embraced open data when he made a move from the White House to Chicago City Hall.

This is the pattern of powerful, enlightened elected officials in the executive branch deciding that open data is good policy. They back this up by empowering people, like former Chicago CIO Brett Goldstein and CTO John Tolva, to develop and implement that policy.

It’s the unique and aggressive policy of publishing data that has brought the movement further here in Chicago.

Excerpt: Beyond Transparency, Building a Smarter Chicago: An Era of Incidental Transparency

For the next few Tuesdays, we will be excerpting sections from Beyond Transparency: Open Data and the Future of Civic Innovation“, an anthology edited by Brett Goldstein with Lauren Dyson and published by Code for America.

I wrote a chapter titled, “Building a Smarter Chicago“, which I call “an illustrative, incomplete, and idiosyncratic look at the ecosystem in Chicago. It is meant to provide a thumbnail take on how the ecosystem developed here, while sparking fires elsewhere”. Here’s the introduction and the first section, which gives a short history of the ecosystem:

Introduction

As the open data and open government movement continues, there is a lot of talk about building local ecosystems for the work. The general idea is that there has to be a mildly magic combination of data, policy, developers, capital, and products to enable the kind of growth that is necessary to take the movement to the next level—where there is a mature market for open government products that serve real community needs and lead to sustainable revenue.

The thing about building an ecosystem is that when it is done deliberately, it can be a slog. Building a developer community from scratch, convincing local government to publish data, getting venture capitalists to take a look at open government projects—all of this is tough work that takes time.

By looking at the Chicago example, however, we can see that there’s often more built than it first seems. The components can be found, in varying degrees, in any unit of government. The trick is to find, cobble, and congeal these pieces together.

What follows is an illustrative, incomplete, and idiosyncratic look at the ecosystem in Chicago. It is meant to provide a thumbnail take on how the ecosystem developed here, while sparking fires elsewhere.

Data: An Era of Incidental Transparency

The story starts with Citizen ICAM (Information Collection for Automated Mapping), the granddaddy of all crime mapping applications, created by the Chicago Police Department in May 1995. I wrote about this system back in 2006 because I wanted to understand the archaeology of this distinctly unique (and relatively difficult to use) interface (O’Neil, 2006). You can learn a lot about software by its backstory. Here’s the first sentence of a July 1996 National Institute of Justice report on Citizen ICAM:

To better understand the nature and extent of criminal and social problems in the community and improve allocation of resources, a growing number of crime control and prevention organizations are turning to computerized mapping. (Rich, 1996)

The impetus behind the project (“Citizen” is the first word in its name) was the Chicago Alternative Policing Strategy (CAPS) program. Here’s another snip from the 1996 report:

ICAM was developed as part of CPD’s far-reaching and ambitious community policing strategy. Unlike many other community-policing programs that are limited to a single unit in the department, the Chicago Alternative Policing Strategy (CAPS) is department-wide. The strategic plan for reinventing CPD describes CAPS as a “wholesale transformation of the department, from a largely centralized, incident-driven, crime suppression agency to a more decentralized, customer-driven organization dedicated to solving problems, preventing crime, and improving the quality of life in each of Chicago’s neighborhoods.

In fact, CAPS is really a city program with strong support from the Mayor’s office and close involvement of city agencies, which have been directed to give top priority to “CAPS service requests” that affect crime and neighborhood safety. (Rich, 1996)

This twenty-year-old project is a model for where we need to be now—and where the movement seems to be heading. It starts with deep input from residents to form a “customer-driven organization.”

In the technology world, we call these people “users.”

Adrian Holovaty’s ChicagoCrime.org—widely considered a major impetus in the open data movement—simply would not have existed without Citizen ICAM (Holovaty, 2008). At the same time, ChicagoCrime.org was certainly not well-formed public data. For instance, all data was retrieved by scraping with obscure URL calls that ignored the user interface, which limited searches to a quarter-mile radius.

Another example is transit data “published” by the Chicago Transit Authority in the context of their proprietary Bus Tracker system. I covered this extensively in a January 2009 blog post (O’Neil, 2009). The upshot is that Harper Reed scraped all data driving the app, cached it, and served it to developers. This led to a blossoming of transit-focused apps.

The culmination of this work is the publication of the CTA’s own API, a document wherein Harper and I are explicitly called out for helping them develop it:

Special thanks go to Harper Reed and Dan O’Neil for their support and encouragement, and to the independent development community, for showing such great interest in developing applications with CTA data, leading to the creation of this official API. Thank you. (Chicago Transit Authority, 2011)

This is the kind of inside/outside game that is also essential to the ecosystem. You have to work with government institutions to make their data fluency and data policy better.

A last example of early data in Chicago (and perhaps the first explicitly conscious publication of data in the city) is the wealth of Geographic Information Systems (GIS) data published by the City of Chicago. This was another early reason why ChicagoCrime (and, by extension, EveryBlock) could exist. Their policy was formalized in July 2007, but the data had been available long before that (City of Chicago, 2007).

The first section of their documentation, “Data Sharing Principles,” has the idea that public information should be public: “Wherever possible, direct requestors to publicly available internet sources of map information.”

This is the moment when the governmental provision of data goes from incidental to essential. Before that magic moment, it’s important for developers and citizens to look harder for data published in plain sight.

Building a Smarter Chicago Chapter in Beyond Transparency, a New Book from Code for America

Today marks the publication of “Beyond Transparency: Open Data and the Future of Civic Innovation“, an anthology edited by Brett Goldstein with Lauren Dyson and published by Code for America. Here’s a blurb:

Beyond Transparency is a cross-disciplinary survey of the open data landscape, in which practitioners share their own stories of what they’ve accomplished with open civic data. It seeks to move beyond the rhetoric of transparency for transparency’s sake and towards action and problem solving. Through these stories, we examine what is needed to build an ecosystem in which open data can become the raw materials to drive more effective decision-making and efficient service delivery, spur economic activity, and empower citizens to take an active role in improving their own communities.

I wrote a chapter for this anthology titled, “Building a Smarter Chicago“, which I call “an illustrative, incomplete, and idiosyncratic look at the ecosystem in Chicago. It is meant to provide a thumbnail take on how the ecosystem developed here, while sparking fires elsewhere”.

I’m really honored to have taken part in this important publication. We’ll be serializing the chapter over the next few days. Meantime, to download or order a printed copy, visit beyondtransparency.org.

Smart Chicago Collaborative and the City’s Technology Plan

Earlier this month, Chicago Chief Technology Officer John Tolva unveiled the city’s very first technology plan. The plan was a result of a year-long process of research, brainstorming, and thinking about how to make all of Chicago competitive in the new digital economy.

This plan is a comprehensive framework for growing Chicago’s technology sector , getting broadband connectivity for everyone, and  ensuring that Chicago remains a leader in open government data .

The plan also highlights the work that the civic technology community has been doing in Chicago. From the weekly OpenGov Hack Nights, the Smart Communities Program, and youth STEM programs; Chicago already enjoys a strong set of technology strengths and this plan will enable the city to advance even further.

Smart Chicago Collaborative is proud to have a key role in many of these initiatives and is dedicated to  implementing this plan. Here’s a look at our role in the plan and the aspects of our existing work in this context.

Continue reading

Smart Chicago + Local Developers + Tweets = Better Public Health

The Smart Chicago Collaborative provided server space on our Amazon Web Service account so that a local developer to develop an algorithm to classify tweets in the service of public health. Here’s how Joe Olson of Tracklytics describes the work:

Tracklytics partnered with Smart Chicago and DePaul University to enter a the Now Trending: #Health in My Community contest sponsored by the Department of Health and Human Services. The purpose of the contest was to spur innovation in analyzing social media outlets to detect global health trends. Specifically, the contest requirements were to use Twitter to identify trending topics relating to disease, such as West Nile virus and Legionnaire’s disease.

Tracklytics designed and implemented the solution, DePaul helped with testing and algorithm validation, Smart Chicago funded the computer time.

We did not win the contest, but several innovations were developed. We now have the ability to collect over 1,000,000 disease-related tweets per day. A subset of these tweets are run through a machine learning algorithm, and can be filtered down to a much smaller set of tweets where someone is reporting having an instance of a disease, as well as their location. Current accuracy of this is near 85%. We’ve reduced the cost of classifying tweets down to .0000057 cents/tweet, storing them for analysis down to 0.000000235 cents/tweet/month, and archiving them down to 0.000000004 cents/tweet/month.

You can view the site at http://hhs.tracklytics.com

We are currently working on modifying this technology to track food poisoning cases in Chicago. Once legitimate food poisoning cases reported by individuals are identified by Twitter using the machine learning algorithm, we can send to the person reporting the case a tweet containing a URL directing them to a form to collect more specific information (do you suspect the incident occurred at a restaurant, was from tainted food, etc). This information can them be entered into the City’s 311 system via the Open311 interface, and directed to the City’s Health Department for further analysis.

Lots more to come from this work— stay tuned!