Skip to content

This is the repository for the Urban Data Science & Equitable Cities Working Group (Fall 2024), an EEAMO Bridges working group.

Notifications You must be signed in to change notification settings

mattwfranchi/EAAMO-Bridges-UDS-F24

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 

Repository files navigation

EAAMO-Bridges-UDS-F24

This is the repository for the Urban Data Science & Equitable Cities Working Group (Fall 2024), an EEAMO Bridges working group.

By 2050, the UN projects that 68% of the population will live in a city. Cities are an increasingly important aspect of daily life, affecting long-term health and access to job opportunities. The use of data to guide decision-making, improve the responsiveness of local government, and reduce inequality has become critical. With the advent of the open data movement for cities, the volume of urban data has exploded. In addition to officially-published open datasets, unique sources of unstructured data, including text and images, are becoming more abundant, providing new opportunities for analyzing urban environments and their occupants. In this EEAMO Bridges working group on urban data science & equity, we host speakers, study papers, and workshop late-stage work relating to the sophisticated computational analysis of urban data, emphasizing topics that both explore and address inequities in urban life. This group began meeting in Summer 2024 as a Cornell reading group on urban data science, incorporating members from information science, computer science, urban planning, sociology, and public policy.

We meet every other week for a presentation from an invited speaker or group member, followed by sustained discussion.

Program

Schedule:

Wednesday, September 25 2024

Wednesday, October 9 2024

  • Guest speakers: Zhi Liu and Sidhika Balachandar
    • Presentation title: Models to Correct Under-Reporting in Urban Crowdsourcing Systems
    • Abstract: Decision-makers often observe the occurrence of events through a reporting process. City governments, for example, rely on resident reports to find and then resolve urban infrastructural problems such as fallen street trees, flooded basements, or rat infestations. Without additional assumptions, there is no way to distinguish events that occur but are not reported from events that truly did not occur. Because disparities in reporting rates correlate with resident demographics, addressing incidents only on the basis of reports leads to systematic neglect in neighborhoods that are less likely to report events. We show how to overcome this challenge in three different settings and estimate reporting rates. First, we leverage the fact that events are spatially correlated and propose a latent variable Bayesian model. Second, we propose a method to fit graph neural networks with both resident report and city inspection data. And third, we incorporate the report delay and the possibility of multiple reports in a Poisson Bayesian model. Our work lays the groundwork for more equitable proactive government services, even with disparate reporting behavior.
    • Associated readings:
  • Discussion on biases and uses of resident crowdsourcing data.

Wednesday, October 23 2024

  • Working group activity: Crafting Research Abstracts: Strava dataset
  • Goal: Gauge interest to collaborate on a project using the Strava metro dataset + outline a project proposal
  • Instructions:
    • Read and familiarize yourself with the Strava metro dataset
    • Think of a research project you would like to pursue with the dataset. If possible, focus on areas of common interesting to the group (equity, urban mobility, housing, policy, etc).
    • Please submit a ``dream research abstract'' on a paper using the Strava dataset. That is, write an abstract as if you have published a paper using the dataset.
      • Include clear motivations and problem statements.
      • Outline briefly the methods you would plan to use.
      • Include ``expected results'': ideally, what do you expect to find? This will help us understand whether, assuming the research is successful, the project is worth pursuing.
      • (Optional) Include a small bibliography of related work.
    • We will get in breakout rooms and analyze a few selected abstracts. Working group members are encouraged to select aspects of each proposed project that are of interest.

Wednesday, November 6 2024

  • Guest speaker: AJ Alvero
    • Presentation title: Who Do Large Language Models Write Like? A Sociolinguistic Perspective on AI-Generated Texts
    • Abstract: When large language models (LLMs) generate "human-like text", which humans do they write like? The answer to this question has strong implications for their use as methodological tools across the social sciences, but they also point to new dynamics to consider in important social processes. In this presentation, I will discuss ongoing work that compares the text produced by LLMs with personal statements written by applicants to selective colleges and universities. We use the Linguistic Inquiry and Word Count (LIWC) dictionary to compare stylistic features backed by psychological research and various embedding approaches that are popular in computational social science. Both approaches yielded similar sets of findings: LLM writing style is distinctive from humans but is closer to those with higher social status. Conversely, we also find that LLMs are stylistically narrow but have broad vocabularies. These results helped us formulate two perspectives for future research on the intersection of sociolinguistics, text, and LLMs: the sampling perspective, based on linguistic closeness (in our case, LLMs were closer in their text to higher social status applicants); and the distributional perspective, based on variation in text features (in our case, LLMs had notably distinctive distributions from any group of humans). We hope this work can help spur not just more empirical work but also theoretical work about human language as it compares and interacts with LLMs.
    • Associated readings: Large language models, social demography, and hegemony: comparing authorship in human and synthetic text

Wednesday, November 20 2024

  • Guest speaker: Wonyoung So
    • Presentation title: Reparative Urban Science: Challenging the Myth of Neutrality and Crafting Data-Driven Narratives
    • Abstract: I offer how urban planning should approach technology within the context of systemic racism, advocating for a reparative approach to address the issues of urban technology perpetuating today’s racial inequality and hindering efforts to redress historical oppression. I identify three mechanisms – formalization, context removal and legitimization, and penalization and extraction – that illustrate how urban technology perpetuates historical inequalities, often penalizing marginalized groups under the pretext of neutrality and fairness. Then, I discuss methodologies of reparative urban science, aiming to use urban technology to challenge race-neutral ideologies and create data-driven narratives for reparations.
    • Associated readings: Reparative Urban Science: Challenging the Myth of Neutrality and Crafting Data-Driven Narratives

Wednesday, December 4 2024

  • Guest speakers: Andrea Vallebueno
    • Presentation title: TBD
    • Abstract: TBD
    • Associated readings: TBD

Wednesday, December 18 2024

  • Working group activity: TBD

Invited Speakers:

Organizers:

We welcome involvement in this working group! If you're interested in joining, please reach out to one of the organizers:

  • Jennah Gosciak (jrg377 AT cornell dot edu)
  • Matt Franchi (mwf62 AT cornell dot edu)
  • Gabriel Agostini (gs665 AT cornell dot edu)

About

This is the repository for the Urban Data Science & Equitable Cities Working Group (Fall 2024), an EEAMO Bridges working group.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published