OpenSky has been founded by scientists for scientists. The mission of our non-profit association since 2014 has been to support open global air traffic research by universities and other not-for-profit institutions. We can confidently state that this has been an increasingly successful endeavour. In this blog post, I want to briefly look back on recent developments, illustrating the growing influence of OpenSky and air traffic data in general on research of all sorts and our new efforts (helped by our impressive community) in supporting more of these attempts with even better and more tools and data. But everything is only possible thanks to our feeders and supporters around the globe! If you want to improve air traffic research around the world for the common good, do join us!
This analysis is just a fraction of the data we have analysed in a recent resarch paper . The figure illustrates the growing number of publications over the past few years based on OpenSky data and divides it also into the most popular research areas over time.
From the paper:
The gray bars illustrate the overall growth trend, which strongly accelerated by 2018 with 22 publications, 144% more than in the previous year. This trend continued in 2019 with a 118% increase. For this year (as of November 2020), 54 publications have been counted, ensuring a new record before the year has ended.
We can break down our analysis further by taking a look at the different scientific subfields in which OpenSky’s data have been used. We identified four large recurring themes: machine learning (ML, 16 papers overall), air traffic management (ATM, 50 papers), (cyber) security (36 papers),and Earth sciences (8 papers). Smaller areas, subsumed under ‘Others’, include research on noise, high-performance computing, sensor networks, and visualization, among others. From the trends ofthe subfields, we can see that security was the largest area of OpenSky data use in the early years up to and including 2018, reflecting the interests of the original community.
You can find every publication using OpenSky data on our Publications page.
Our target groups are researchers from two different spheres: academics and non-profit organsiation. The figure below provides an illustration of 100 academic research groups, which have received access to our Impala shell over the course of 2019.
We can see that these are groups from all over the world!
Besides academics, we support and help a lot of non-profit organisations , NGOs and journalists, which have been using our flight data for everything from internal research to publicly released analyses. Example use cases range from gauging Covid-19-related impact to investigative journalism. The full updated list of these can be found on our Social Impact page but they include for example Bellingcat, the United Nations or the Bank of England!
As also discussed in the paper , our existing access methods -- the live API and the Impala database -- have served us and many members of the scientific community very well. However, we have also realised that we can do better and provide both tools and datasets tailored more to specific settings and use cases to make it easier for non-experts to read and use our data.
To this end, we have created two new subpages, which collect and curate a) data tools provided by our community and b) datasets based on OpenSky data.
Our data tools page currently introduces five different, community-maintained tools, which can be used to access, process and visualize data from our Impala shell in different programming languages. These tools are already helping many researchers with easier access to our massive amounts of data stored. Go check them out!
Likewise, another avenue we have spent more time on in 2020, was to prepare different types of datasets based on OpenSky's whole data. These datasets fulfill different purposes, including better accessibility also without a working knowledge in programming/shell scripting and improved scientific comparability. Other datasets are the artifacts of scientific papers and have been released by their authors. We have collected all these efforts on our scientific dataset page, which currently lists eight different datasets!
 Martin Strohmeier. Research Usage and Social Impact of Crowdsourced Air Traffic Data.
In Proceedings of the 8th OpenSky Symposium 2020. November 2020.
 OpenSky Data Tools: https://opensky-network.org/data/data-tools
 OpenSky Scientific Datasets: https://opensky-network.org/data/datasets
 OpenSky Social Impact: https://opensky-network.org/about/social-impact