Skip to content

Script for creating speech stats prot-speeches.csv #4

Answered by BobBorges
Lauler asked this question in Q&A
Discussion options

You must be logged in to vote

It's in readme/src/generate-markdown.py in the function starting on line 95 count_pages_speeches_words(). We rely on the assumption that a speech has an introduction, and essentially count the introductions as a proxy for speeches.

Counting next attribs will miss speeches contained in a single <u> element (no idea how many that might be) and include text that is not a speech, but was classified as <u> (also not sure of the scale, but we know it happens).

Replies: 2 comments 4 replies

Comment options

You must be logged in to vote
4 replies
@Lauler
Comment options

@MansMeg
Comment options

@Lauler
Comment options

@BobBorges
Comment options

Answer selected by BobBorges
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants