Data help- Population by age group by US County

Dec 16, 2021

Hey all-

If anyone knows of a source for Population by US County broken down by age, I would appreciate it. Alternatively, it looks like the data is here , but as far as I can tell it’s a file for each state, which I’d rather not deal with…. I know, I know I should skill up on Python. 😁

9 Comments

ianbot

Dec 16, 2021Edited

It looks like the data you're after is already here: https://www2.census.gov/programs-surveys/popest/datasets/2010-2020/counties/asrh/CC-EST2020-ALLDATA6.csv (135MB+) with the secret decoder ring here: https://www2.census.gov/programs-surveys/popest/technical-documentation/file-layouts/2010-2020/cc-est2020-alldata6.pdf

Expand full comment

Reply (1)

T Coddington

Dec 16, 2021

ianbot wins the coveted "Reader of the Month" award! Your prize is a free subscription to I Numero 🤣😉... but seriously, thank you!

Expand full comment

David Watson

Dec 17, 2021

Age isn't a relevant criterion. It correlates because lots of old people have accumulated conditions caused by decades of poor choices. Many don't. These underlying conditions are what produces the immune deficiencies that contribute to serious outcomes to infections. This virus is easy prey for healthy immune systems. The real scandal is healthy immune systems are easy to maintain, but not profitable for the medical industrial complex. They have a name for obese diabetics who sit indoors watching sitcoms so have low vitamin D -- customers.

They're updating the census this year, apparently. Harassed me for months. Who does that -- commerce? They have the data you want, but not the data you need. I don't know if they'll give it to you.

Expand full comment

FHL Badenhorst

Dec 16, 2021

Perhaps IPUMS has the data in a format that you might find more usable. See: https://www.ipums.org/ - I think that IPUMS NHGIS looks promising, based on your comments; https://www.nhgis.org/

Expand full comment

Steve

Dec 16, 2021

That's where I was gonna say to go. County level data is a mess in the CDC databases that I've seen. Was working with some of the datasets months ago and settled on state because cleaning the county data would take way longer than I had.

Expand full comment

Reply (1)

Steve Gerard

Dec 16, 2021

So this is no good ? https://wonder.cdc.gov/bridged-race-population.html

Expand full comment

Reply (1)

Steve

Dec 16, 2021

I haven't looked at that one, I'd say download the csv and give it a check. I'm not the best coder either so for my purposes at the time it was going to be faster for what I was doing to do state level data. I was looking at a few different variables some of which were absent from some of the county databases.

Expand full comment

Reply (1)

Steve Gerard

Dec 17, 2021

During the vetting phase I tend to load any nominated dataset into a common area for review which is modelled so that each field value lands in a new row. In that manner there is no need to design a dedicated receiving space - all simply land in a generic structure of run identity, field number, and field value broken down by numeric, character, date or json/array type. This does not stop there being dedicated receiving areas one per candidate file type but does allow separation out before any decisions about the quality of contents. Eventually I expect to make the landing area sparse so that repeated values take less space after the first example of the value holds a slot of instanciation. Eventually also expect to dynamically create targeted receiving tables on the basis of the formats revealed by inspection of emerging field values. A kind of scanner for data which can readily ingest new sources without prior design work. However it is not enough to load data sources, we need to be able to annotate the records with a kind of water mark which evokes data quality measures and presents lineage to answer future questions.

Expand full comment

Reply (1)

Steve

Dec 17, 2021

This is super helpful, thanks!

Expand full comment