Convert Figma logo to code with AI

BuzzFeedNews logoeverything

An index of all our open-source data, analysis, libraries, tools, and guides.

1,302
131
1,302
2

Top Related Projects

16,760

Data and code behind the articles and graphics at FiveThirtyEight

The Washington Post is compiling a database of every fatal shooting in the United States by a police officer in the line of duty since 2015.

A repository of data on coronavirus cases and deaths in the U.S.

Quick Overview

The BuzzFeedNews/everything repository is a comprehensive collection of data, methodologies, and analyses used in BuzzFeed News stories. It serves as a transparent resource for journalists, researchers, and the public to access and verify the data behind BuzzFeed's investigative reporting and data-driven stories.

Pros

  • Promotes transparency in journalism by sharing raw data and methodologies
  • Provides valuable datasets for researchers and data enthusiasts
  • Encourages reproducibility of analyses and findings
  • Serves as an educational resource for aspiring data journalists

Cons

  • May require technical knowledge to fully utilize some datasets
  • Not all BuzzFeed News stories have corresponding data in the repository
  • Some datasets may become outdated over time
  • Lack of standardized format across different projects

Getting Started

As this is not a code library but a collection of data and analyses, there's no specific code to get started. However, you can follow these steps to explore the repository:

  1. Visit the GitHub repository: https://github.com/BuzzFeedNews/everything
  2. Browse through the folders to find specific projects or datasets
  3. Read the README files in each project folder for context and instructions
  4. Download or clone the repository to access the data locally
  5. Use your preferred data analysis tools (e.g., R, Python, Excel) to explore the datasets

Note: Some projects may have specific requirements or dependencies, so be sure to check the individual project documentation for detailed instructions.

Competitor Comparisons

16,760

Data and code behind the articles and graphics at FiveThirtyEight

Pros of data

  • More frequently updated with new datasets
  • Better organized directory structure
  • Includes detailed README files for each dataset

Cons of data

  • Narrower focus on political and sports data
  • Less diverse range of topics covered
  • Fewer total datasets available

Code comparison

data:

import pandas as pd

df = pd.read_csv('datasets/nfl-elo/nfl_elo.csv')
print(df.head())

everything:

import json

with open('data/2016-10-presidential-campaign-donors/data.json') as f:
    data = json.load(f)
print(data[:5])

Summary

Both repositories provide valuable datasets for analysis, but they cater to different needs. data offers a more structured and frequently updated collection, focusing on political and sports data with detailed documentation. everything covers a broader range of topics but with less organization and fewer updates. The code examples demonstrate different data formats used in each repository, with data primarily using CSV files and everything using JSON.

The Washington Post is compiling a database of every fatal shooting in the United States by a police officer in the line of duty since 2015.

Pros of data-police-shootings

  • Focused dataset on a specific topic (police shootings)
  • Regularly updated with new incidents
  • Clear documentation and methodology

Cons of data-police-shootings

  • Limited scope compared to the broader range of topics in everything
  • Less diverse data formats and analysis tools
  • Fewer supplementary resources and explanatory materials

Code comparison

data-police-shootings:

import pandas as pd

df = pd.read_csv('fatal-police-shootings-data.csv')
print(df.head())

everything:

import pandas as pd
import matplotlib.pyplot as plt

df = pd.read_csv('data/some_dataset.csv')
df.plot(x='date', y='value')
plt.show()

The data-police-shootings repository provides a straightforward CSV file for analysis, while everything offers a wider range of datasets and potentially more complex analysis tools. The code examples reflect this difference, with data-police-shootings focusing on basic data loading and everything showcasing more advanced visualization capabilities.

A repository of data on coronavirus cases and deaths in the U.S.

Pros of covid-19-data

  • Focused and specific dataset, making it easier to navigate and use for COVID-19 related analysis
  • Regularly updated with current data, ensuring relevance for ongoing research
  • Well-documented data sources and methodologies

Cons of covid-19-data

  • Limited scope, only covering COVID-19 data
  • Less diverse in terms of data types and topics covered
  • May require additional datasets for comprehensive analysis

Code Comparison

covid-19-data:

import pandas as pd

df = pd.read_csv('us-states.csv')
df['date'] = pd.to_datetime(df['date'])
df = df.sort_values(['state', 'date'])

everything:

import pandas as pd

df = pd.read_csv('data/some_dataset.csv')
df['date'] = pd.to_datetime(df['date'])
df = df.groupby('category').agg({'value': 'sum'})

Both repositories use pandas for data manipulation, but covid-19-data focuses on COVID-specific data processing, while everything demonstrates more general-purpose data analysis techniques across various datasets.

Convert Figma logo designs to code with AI

Visual Copilot

Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.

Try Visual Copilot

README

BuzzFeedNews/everything

An index of all our open-source data, analysis, libraries, tools, and guides.

Table of Contents


Data and Analyses

DateDescriptionRepo(s)Article(s)
2022-04-27Data and analysis of state child abuse and neglect registries and appeals:link::link:
2022-04-25Data and analysis of intermediate care facilities:link::link:
2021-09-17Data and analysis re. US adult guardianship filing counts:link::link:
2021-05-26Analysis of excess deaths caused by the February 2021 winter storm and power outages in Texas:link::link:
2020-11-11Analysis of county-level COVID-19 deaths and presidential voter preference:link::link:
2020-10-28Analysis of 2020's "Electoral College effect" by demographic:link::link:
2020-06-04Analysis of "1033" program transfers since Ferguson:link::link:
2020-05-07Analysis of ZIP code–level COVID-19 cases in five major cities:link::link:
2020-02-27Analysis of Census tract–level gentrification in five major cities:link::link:
2019-11-11Analysis of U.S. Census Survey of Income and Program Participation (SIPP), re. generational trends in support providers:link::link:
2019-10-31Analysis for "Your Dumb Tweets Are Getting Flagged To People Trying To Stop School Shootings":link::link:
2019-10-17Analysis for "Donald Trump’s Campaign Is Cashing In On Impeachment":link::link:
2019-10-03Analysis of FCC comments and data breaches:link::link:
2019-08-03Analysis of ActBlue's 2019 mid-year FEC report:link::link:
2019-07-17Analysis of contributions to presidential campaigns, based on 2019 Q2 filings:link::link:
2019-04-22Data and code to make maps and animation depicting current realities of climate change:link::link:
2019-04-16Analysis of donors giving $200+ to multiple Democratic candidates early in 2020 election cycle:link::link:
2019-01-24Data and code for, "Shoot Someone In A Major US City, And Odds Are You’ll Get Away With It":link::link: :link:
2018-12-28Year-end analysis of fake news sites and viral posts, 2016–2018:link::link:
2018-12-19Analysis of WeChat posts re. VP Pence:link::link:
2018-10-25Analysis and graphics for "How Russia’s Online Trolls Engaged Unsuspecting American Voters — And Sometimes Duped The Media":link::link:
2018-10-18Analysis of 2018 midterm election demographics:link::link:
2018-09-29Analysis of 'immigration services'-related FTC complaints:link::link:
2018-08-10Data, analysis, and graphics for "Russian Trolls Swarmed The Charlottesville March — Then Twitter Cracked Down":link::link:
2018-07-28Analysis of wildfire trends (with graphics):link::link:
2018-07-26Analysis of children's home inspection data from the UK's Office for Standards in Education, Children's Services and Skills ("Ofsted"):link::link::link:
2018-06-29Analysis of NYC 311 complaints and gentrification:link::link:
2018-05-01Analysis of fentanyl and cocaine overdose deaths:link::link:
2018-03-02Analysis of diversity in the dialogue of Best Picture–nominated films:link::link:
2018-02-23Analysis of Olympic figure skating scores:link::link:
2018-02-08Data and analysis for "The Edge" (re. figure skating):link::link:
2018-01-31Analysis of the text of every State of the Union address:link::link:
2018-01-24Data and analysis for "An Inside Look At The Accounts Twitter Has Censored In Countries Around The World":link::link:
2018-01-23Data and analysis for "How Trump’s Tweets Shaped A Year In Politics":link::link:
2017-12-28Data and analysis for "These Are 50 Of The Biggest Fake News Hits On Facebook In 2017":link::link:
2017-12-10Data, analysis, and charts for "What Sexual Misconduct Allegations Are Getting The Most Attention On Cable News?":link::link:
2017-12-05Data and analysis for "We Got Government Data On 20 Years Of Workplace Sexual Harassment Claims. These Charts Break It Down.":link::link:
2017-11-15Data on, and analysis of, federal employee diversity:link::link:
2017-11-03Data and code for "Under Trump, Gun Sales Did Not Spike After The Las Vegas Shooting":link::link:
2017-09-19Updated analysis of Harvey-related industrial emissions in Texas:link::link:
2017-09-11Federal employee departure rates, for "Trump’s Election Didn’t Spark An Immediate Exodus From The Federal Government":link::link:
2017-09-02FOIA logs referenced in "These Scientists Got To See Their Competitors’ Research Through Public Records Requests.":link::link:
2017-08-31Data and analysis on Harvey-related industrial emissions in Texas:link::link:
2017-08-08Data and analysis for "Inside The Partisan Fight For Your News Feed":link::link:
2017-08-07Data and analysis for "BuzzFeed News Trained A Computer To Search For Hidden Spy Planes. This Is What We Found.":link::link:
2017-07-25Data and analysis for "If Jeff Sessions Exits, Trump Could Choose An Acting Attorney General From Among Thousands Of People":link::link:
2017-05-24R code to recreate the graphics in "Why Americans Are So Damn Unhealthy, In 4 Shocking Charts":link::link:
2017-04-04Data and analysis supporting portions of "Fake News, Real Ads":link::link:
2017-01-31R code to recreate the graphics in "These Nobel Prizewinners Show Why Immigration Is So Important For American Science":link::link:
2017-01-19Data and analysis supporting "Most American Adults Get News From Facebook — But They Don’t Really Trust It, A New Survey Says":link::link:
2017-01-18Data and R code to reproduce the graphics in "2016 Was The Hottest Year. Yes, Greenhouse Gases Are To Blame.":link::link:
2016-12-29Data and analysis re. transgender rights survey:link::link::link:
2016-12-20Data and code to reproduce the graphics from "2016 Will Be The Warmest Year, But This Is How Deniers Will Spin It":link::link:
2016-12-07Data, methodologies, and analyses supporting "Intake":link::link::link::link:
2016-12-06Data and analysis supporting "Most Americans Who See Fake News Believe It, New Survey Says":link::link:
2016-11-28Data and code supporting evaluation of forecasters' 2016 election forecast:link::link:
2016-11-07Data and analysis supporting "How The Electoral College Screws Hispanic And Asian Voters":link::link:
2016-11-03Analysis of "bellwether" counties in U.S. presidential elections:link::link:
2016-10-27Data and analysis supporting "Clinton Receives Thirty Times As Much Tech Cash As Trump":link::link:
2016-10-20Data and analysis supporting "Hyperpartisan Facebook Pages Are Publishing False And Misleading Information At An Alarming Rate":link::link:
2016-10-09Data and analysis re. White ancestry and Trump support:link::link:
2016-09-16Code supporting "Why 'Shy Trumpers' Probably Won't Decide The Election":link::link:
2016-09-08Data and analysis supporting "When Detectives Dismiss Rape Reports Before Investigating Them":link::link:
2016-08-22Data and analysis supporting "How Katie Ledecky Stacks Up Against Male Swimmers":link::link:
2016-07-30Data and code supporting "Why Track-And-Field Stars Don’t Set World Records Like They Used To (But Swimmers Do)":link::link:
2016-07-24Data supporting "The Republican Convention Was Secretly Watched From Above" and "Government Spy Planes Circled Over The Democratic Convention More Intensely Than GOP Event":link::link: :link:
2016-05-12Analysis of H-2 debarments, violations, and certifications ("The Pushovers"):link::link:
2016-04-26Analysis of GOP donor movements post-Bush and post-Rubio:link::link:
2016-04-20Analysis of Bernie Sanders's ActBlue donors:link::link:
2016-04-06Data and code for "Spies In The Skies":link::link:
2016-02-01Bush/Rubio/Cruz donor movement analysis:link::link:
2016-01-29Data and code for "America's Quiet Crackdown On Indian Immigrants":link::link:
2016-01-26Analysis of Jefferson County (TX) jail data:link::link:
2016-01-26Analysis of criminal case dispositions in Texas municipal courts:link::link:
2016-01-17Methodology and code for "The Tennis Racket":link::link:
2015-12-29Data and analysis for "The Coyote":link::link:
2015-12-09How long will the Warriors' win streak last?:link::link:
2015-12-07Race and fatal police shootings:link::link:
2015-12-02Time elapsed between mass shootings in the U.S.:link::link:
2015-12-01H-2 visa certifications and experience requirements:link::link:
2015-11-24Simulated lottery odds:link::link:
2015-11-19Refugee arrivals in the United States:link::link:
2015-10-16Data and analysis re. Scott Walker's donors post-dropout:link::link:
2015-08-25Data and analysis re. immigrant detention rates:link::link:
2015-07-24Data and analysis re. H-2 visa certifications and enforcement:link::link:
2015-07-07Data and analysis re. the use of primates in biodefense research:link::link:
2015-06-04Data and analysis of BuzzFeed/Ipsos poll on same-sex marriage and abortion views:link::link: :link:
2015-05-03Analyzing #talkpay tweets:link::link:
2015-03-06Analyzing state-by-state changes in earthquake frequency:link::link:
2015-02-20Analyzing deficiencies among Texas foster care child placing agencies:link::link:
2015-02-20Analyzing performance scores of Georgia child placing agencies:link::link:
2014-10-19Debunking the Obama-pronoun myth — data and code:link::link:
2014-09-05Detecting Sunday morning show guests whose "stars are rising" — data and code:link::link:
2014-09-04Comparing college costs to minimum-wage earnings — data, sourcing notes, and analysis:link::link:
2014-08-20Quantifying racial segregation in St. Louis County — code:link::link:
2014-08-13NBA owners' winning percentages — data:link::link:
2014-08-07FTC complaints re. IRS impersonators — data and analysis:link::link:
2014-06-30Firework-related injuries — data:link::link:
2014-06-16Mapping the gender divide in bikeshare programs — data and code:link::link:

Standalone Datasets

RepoDescription
trumpworldData from TrumpWorld
presidential-campaign-contributionsContributions, transfers, and refunds from recent U.S. presidential candidates' principal campaign committees.
nics-firearm-background-checksMonthly data from the FBI's National Instant Criminal Background Check System, converted from PDF to CSV.
H-2-certification-dataH-2 visa certification data & data-standardization.
opm-federal-employment-data40+ years of federal employment data from the Office of Personnel Management

Libraries and Tools

RepoDescription
whtranscriptsFetch and parse the American Presidency Project's press-briefing and presidential-news-conference transcripts.
bikesharesStandardized parsers for data published by bicycle-sharing programs. Currently supporting: NYC's Citi Bike, Chicago's Divvy, and Boston's Hubway.
twickTwitter, quick. Fetch and store tweets on short notice.

Guides

RepoDescription
zika-dataData — and pointers to data — related to the 2015–16 Zika virus outbreak.
bikeshare-data-sourcesGuide for getting trip history and station data from various bicycle-sharing programs.