Overview

This reference page for Measuring Broadband America provides the raw bulk data set, containing data for all tests conducted for this study for the period from February 2011 through June 2011. This data set is distinguished from the validated data set of March 2011 on which the report was based in the following ways:

  • The March 2011 data set was validated to remove statistically low sample counts and other anomalies. (See the report Technical Appendix and other material posted on the website describing the validation process.) The raw bulk data set includes all data that was acquired during testing.
  • The raw bulk data set can include ISPs or regions that may not have been included in the report due to low sample counts.

The data set includes all tests run from February 2011 through June 2011. The data set has been pre-split into individual months for easier downloading.

Files

Data Dictionary

Description

This document provides a brief explanation of each field included in the data set, and is provided as documentation on the structure of the data.

Why We Provide It

This file is provided as documentation on the structure of the data. This one file describes all data files page.

Download

raw-bulk-data-2011-dict.xls

Data Cleansing

Description

A set of files when run against the raw March data set will produce a validated set of data equivalent to that used for the FCC report.

Why We Provide It

These files, a set of files corresponding to specific performance metrics with each file containing SQL statements, are provided for full transparency and to facilitate researchers in examining the data. It describes the manipulation of source to produce a validated data set for March, 2011, the reporting period of the FCC Measuring Broadband America report. As noted in the documentation, certain data elements are removed from the data set when anomalies are noted, e.g. a white box stops reporting for a significant period of time, and the resulting processed data set, termed a validated data set, is used as the basis for the report.

Download

raw_data_cleansing_scripts_2011.tar.gz

Unit Profile

Description

This document identifies the various details of each test unit, including ISP, technology, service tier, and general location. Each unit represents one volunteer panelist. The unit ID's were random, which served to protect the anonymity of the volunteer panelists. Technical note: This is a large, normalized data set which expands to multiple files. Most users will need to import this data into a relational database for viewing.

Why We Provide It

This data is provided as reference for researchers seeking to look up information for the individual units running tests.

Download

unit-profile-2011.zip (111 KB download, 548 KB expanded, checksums)

Unit Census Block

Description

This spreadsheet identifies the census block in which each unit running test is located. Census block is from 2000 census and is in the FIPS block code format. No block contains less that 35 people.

Why We Provide It

To identify the general geographic location of the unit running the test.

Download

Unit_ID_Census_Block_Group.xls (548 KB download, 548 KB expanded)

Raw Bulk Data

Description

This document contains the raw bulk data from which the validated March 2011 data set was taken that was used in the Report. It includes the detailed results for all 13 tests that were run as part of this study, in addition to the results of a separate quality metric run to assess which reporting units were available. The raw bulk data set was collected as discussed in the Report and Technical Appendix; further details can be found in the Methodology section of this page. Technical note: This is a large, normalized data set which expands to multiple files. Most users will need to import this data into a relational database for viewing.

Why We Provide It

The validated data is provided for full transparency and to enable researchers to conduct their own analysis from a common pool of data.

Download

raw-bulk-data-feb-2011.tar.gz    (1.9 GB)
raw-bulk-data-mar-2011.tar.gz    (3.9 GB)
raw-bulk-data-apr-2011.tar.gz    (4.0 GB)
raw-bulk-data-may-2011.tar.gz    (4.2 GB)
raw-bulk-data-jun-2011.tar.gz    (4.0 GB)
raw-bulk-data-jul-2011.tar.gz   (3.2 GB)
raw-bulk-data-aug-2011.tar.gz   (3.1 GB)
raw-bulk-data-sep-2011.tar.gz   (3.3 GB)
raw-bulk-data-oct-2011.tar.gz     (4.8GB)
raw-bulk-data-nov-2011.tar.gz     (4.6 GB)
raw-bulk-data-dec-2011.tar.gz    (4.7GB)

Bureau/Office: