Data Processing

Data and Processing Information

About the Dataset

I am using data from the United States Census Bureau. To explore which exact datasets were used, please click one of the following:

Housing Dataset - This dataset contains information on household values and monthly rent. The dataset had 143 rows as of May 15th, 2019. The geographic locations I am using are all states in the United States (along with the District of Columbia). The exact rows I am using are Value and Gross Rent along with their median values. I am only using data from the year 2017 in order to display the most recent data. This dataset is used for the Housing visualizations.

Housing and Income Dataset - This dataset contains information on family household and non-family household income. The dataset had 40 rows as of May 15th, 2019. The two geographic locations I am using for comparison are the United States and Hawaii. The exact columns/rows I am using are family income by family size and nonfamily households. I am only using data from the year 2017 in order to display the most recent data.

Comparative Economics - This dataset contains information on occupations, industries, and class of workers of civilian employed population 16 years and over. The dataset had 137 rows as of May 15th, 2019. The geographic locations I am using for comparison are the United States, California, Hawaii, Maryland, New York, and Washington. The exact columns/rows I am using are Occupation, Industry, and Class of Worker from 2013-2017 estimates for the year 2017. The data I am using only includes civilian employed population 16 years and over within each respective geographic area.

Note - I would first like to note that many of the values in each dataset does not always add up exactly to 100% for population characteristics. All estimates in these datasets are ESTIMATES based upon the American Community Survery and the results from that survey. The survey is sent out every year to millions of households across the United States but the United States Census Bureau only makes estimates based upon the results they receive. This data does not factually represent population characteristics to the exact number. If you would like to discover how they perform the estimates, click one of the links above or visit the The United States Census Bureau's website. Secondly, I would like to note that sometimes the links that refer directly to the dataset I am using do not instanly take you to the dataset. One solution might be to click on this link here then click on one of the data set links. The second solution is to go to factfinder.census.gov, click on "advanced search", then click on "SHOW ME ALL", then click on one of the data set links.




Liscense

As stated on usa.gov:

United States government creative works, including writing, images, and computer code, are usually prepared by officers or employees of the United States government as part of their official duties. A government work is generally not subject to copyright in the United States and there is generally no copyright restriction on reproduction, derivative works, distribution, performance, or display of a government work. Unless the work falls under an exception, anyone may, without restriction under U.S. copyright laws:

  • Reproduce the work in print or digital form
  • Create derivative works
  • Perform the work publicly
  • Display the work
  • Distribute copies or digitally transfer the work to the public by sale or other transfer of ownership, or by rental, lease, or lending

  • For more information, please refer to usa.gov.




    Credit

    A huge thank you to the following for data, inspiration, and code:

    Data: United States Census Bureau.
    Template: This template was used from the University of San Francisco's CS 360/560 Data Visualization class.
    Code/Examples: Credit to Harry Stevens for inspiration and code for the d3 visualization on the career page. Many of the visualizations also came from Tableau.