Skip to content

MODA-NYC/Agency-Name-Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

NYC Agency Name Project

Project Overview

This project aims to create a standardized list of Agency Names* and publish this as a dataset on the NYC Open Data portal. The primary goal is to enhance data legibility and interoperability by providing official, consistently formatted agency names. This will provide a clear canonical source for how to format Agency Names, improving data quality and saving time when joining datasets on the Agency Name field.

This project is being developed by the Data Governance team in the Office of Data and Analytics.

*The word “Agency” is colloquially used to mean a government organization that includes a New York City Agency, a Mayoral Office, or a Commission.

Original Project Plan document: NYC Agency Name Project Code Notebook originally developed as a Google CoLab project: Agency Name Project.ipynb

Objectives

  1. Collect all relevant lists of Agency names from various sources.
  2. Standardize Agency Names: Develop a standardized list of agency names and acronyms.
  3. Possibly publish a “crosswalk” between different common formats of Agency Names (e.g. alphabetized or not alphabetized, with or without acronym).
  4. Publish a Standardized list to the Open Data portal.
  5. Document a process for maintaining this data asset as new agencies are added or removed, or agencies change their names.

Data Sources

Methodology

  1. Data Collection: Aggregate data from the aforementioned sources.
  2. Data Standardization:
  • Create several columns for each agency name in various formats (e.g., with and without acronyms, alphabetized).
  • Deduplicate names to create a list of unique agency names.
  • Manually evaluate outliers.
  1. Classification:
  • Classify each entity as an "Organization" per the DCAT-US-3 standard.
  • Assign "Organization Type" (e.g., City Agency, Mayoral Office, Commission).
  • Add alternate names and legal authorizing authority as needed.
  1. Additional Information:
  • Include fields for the current commissioner, website URL, and other relevant details.
  • Ensure all field names align with DCAT-US-3 standards, adding extensions where necessary.

Field Names and Definitions and Field Values Definitions

See Data Dictionary.

Maintenance Plan

  • Responsibility: The dataset will be maintained by ODA.
  • Update Process: Document and establish a procedure for adding, removing, or modifying organization names in the dataset.

Stakeholder Engagement

  • Internal Collaboration: Engage with NYC government employees for verification and feedback where needed.
  • Community Engagement: Update and involve the BetaNYC community, soliciting feedback on project plans and output formats.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published