Are the aircraft database dumps working?

2 years 7 months ago #1626 by fquad
Hi, I noticed that the aircraft data here is perhaps being cut short, as all months have exactly 460,000 entries. All monthly files also have their "date modified" field updated every day, so perhaps they are also (or only?) being constantly overwritten. Looking at an archived snapshot from July 2018, the file had 465583 entries then. I just hope the history of the database isn't lost.

Please Log in or Create an account to join the conversation.

2 years 7 months ago #1632 by strohmeier
Honestly, I never looked at the files in detail. They are definitely not the same but if you are looking into it more deeply, let me know.

Please Log in or Create an account to join the conversation.

2 years 7 months ago - 2 years 7 months ago #1634 by fquad
It sure looks like the files are being cropped at 460,000 entries, every snapshot has this exact number. They are indeed not same, as the plot below demonstrates, so the overwrite theory should not be true.

 

Comparing the '2020-12' snapshot with the '2021-09' one, 20,413 icao24 codes are present only in the older snapshot. I looked at the 16 type B738 aircraft among those (listed below), and they all appear to still be in OpenSky database accessed through the web (opensky-network.org/aircraft-database), even though they are not present in the current database dump. Some of those are currently flying, so their omission really looks like a bug.

'a05dbc', '424905', 'ab62f8', '4aca54', '4cab9e', '71c232', '750202', '0c20b6', '0c205b', 'a356f6', '4ca891', 'a47d05', '781047', 'e07583', '78044b', 'a9d712'
Attachments:

Please Log in or Create an account to join the conversation.

2 years 6 months ago #1640 by jluebbe
I experienced something similar. I added about 50 aircraft but most of them did not appear in the dump even a few days later. However the data is not lost. It stays available in the online version of the database.

Please Log in or Create an account to join the conversation.

2 years 4 months ago #1697 by strohmeier
So, looking into this, a (random) part of the database doesn't get written, likely running out of memory.

Definitely some bad news in that we won't be able to restore that historically. :/ The actual database itself was never affected, including the access via the website interface.

Hope we can fix this soon. For the time being, here's a complete manual dump of the database from today, 520000 lines instead of the capped 460000.

www.dropbox.com/s/ka61swsxtmrzfd8/aircra...atabase.csv.zip?dl=0

Please Log in or Create an account to join the conversation.

2 years 2 months ago - 1 year 11 months ago #1757 by strohmeier
Edit: seperate files now in the metadata folder.

opensky-network.org/datasets/metadata/

Please Log in or Create an account to join the conversation.

2 years 5 days ago #1881 by aleb
Where is the broken script performing the dump?

Please Log in or Create an account to join the conversation.

2 years 5 days ago - 2 years 5 days ago #1882 by aleb
The manual dumps are missing some columns, for example:

$ grep 01002c aircraft-database-2022-04.csv 
"01002c","SU-GBT","AIRBUS","Airbus","A321 231","A321","680","","L2J","Egyptair","EGYPTAIR","MSR","MS","Egypt Air","No ADS-B Emitter Category Information"
$ grep 01002c ~/Downloads/aircraftDatabase.csv 
"01002c","SU-GBT","AIRBUS","Airbus","A321 231","A321","680","","L2J","Egyptair","EGYPTAIR","MSR","MS","Egypt Air","D-AVZB","1997-05-30","","","1997-01-01","","","","false","false","false","","No ADS-B Emitter Category Information"
 

Please Log in or Create an account to join the conversation.

1 year 2 months ago #2340 by srcejon
It seems this is now working again (file has grown from 80MB to 130MB), however, I'm now seeing duplicate entries with different information, for the same icao24, but just differing in case. E.g:

"aa6b14","N770X","LEARJET","Learjet Inc","60","LJ60","60-336","","L2J","","","","","Mds Aviation Llc","","","2029-07-31","","2008-01-01","","","P&W CANADA PW305A","false","false","false","",""
...
"AA6B14","N770X","LEARJET","","","LJ60","","","L2J","","","","","","","","","","","","","","false","false","false","",""

In the old dumps, there didn't appear to be any icao24s in upper case.

Should these be merged in the db with case being enforced for new entries?

Please Log in or Create an account to join the conversation.

Powered by Kunena Forum
This website uses cookies to offer you the best experience of our services. By using this website you agree to our privacy policy!