Hunter Owens



Open Source, Closed Data

17 Dec 2014

One problem organizations have faced in the past when opening up their data is that, well, the data has to remain closed. For example, data with either HIPAA or FERPA provisions is pretty much stuck in a closed state. Furthermore, many organizations are reluctant to open up certain datasets. However, all hope is not lost, rather, this article will advance the idea of Open Source, Closed Data and present several implementation strategies.

The Reality

As much as voices in Civc Tech believe in the imperative of opening up datasets, there are certain situations in which it just cannot happen. However, this should not prevent an organization from being able to take advantage of the benefits of open sourcing their code.

Partners with the Data Science for Social Good fellowship know that we release all of our code under an open source license, but alas, not all are willing to let their data or information out into the public. However, as a civic tech community, we need to encourage non-profits, governments, and others to open up their code, even if they cannot or will not open up their data.

Why?

Organizations benefit from putting open source code out there- the benefits far outweigh the cost. First, Open source code is a key metric that good talent will be looking for- hiring becomes a whole lot easier when you can just say ‘here, take a look at what we’ve done’. Furthermore, participating in open source allow you and similar organizations to share efforts. For example, the NPR Apps Template and the Chicago Tribune’s Tarbell Project have become standards in journalism, used outside of their organization and receiving improvements because of it.

Open sourcing acts as a signal that your organization is up to date with certain practices and norms- furthermore, this can be a way of modernizing and keeping up to date with best practices. However, many organization are wary of sharing code - this should not be as much of a concern as a it often is, alas. Working in the open often improves the quality of the work and the code, as bad practices such as putting secret keys into source control become untenable. Large organizations such as Facebook, Google and smaller organizations release open source code, and civic tech groups should also.

Sharing concerns with code often come down to is somebody going to be able to duplicate the work. However, the advantages of open source outweigh the costs. In fact, one may be able to drive industry specific standards by participating in open source. Another concern tends to be embarrassment over programming standards, although, this fear is almost certainly unfounded. Open Tech Strategies is great at helping build support for open tech projects, and I would encouraging contacting them if you need help inside your organization.

Strategies

Thoughts/Comments/Concerns

Email the author or tweet @hunter_owens.