Skip to content
This repository has been archived by the owner on May 8, 2024. It is now read-only.

Quality assessment of the MP database #243

Closed
fredrik1984 opened this issue Mar 7, 2023 · 10 comments
Closed

Quality assessment of the MP database #243

fredrik1984 opened this issue Mar 7, 2023 · 10 comments
Milestone

Comments

@fredrik1984
Copy link
Collaborator

Our MP database will be an important infrastructure in itself. Hence, the goal should be to have ALL MPs since 1867 in the database, with at least some rudimentary metadata. We need therefore to perform a manual quality assessment of the database and the MPs included (and not) in it.

Important MP metadata categories: name, party, gender, date of birth, i-ort, kammaruppdrag (ordinarie, ersättare, statsrådsersättare), valkrets, utskottsuppdrag/roller (committees, government, speaker of the house), uppropsnummer, chair, occupation.

A first step could be to draw a sample of MPs from the biography series and see how many are missing from our database.

@ninpnin
Copy link
Collaborator

ninpnin commented Mar 13, 2023

This would be very good.

biography series

Is this easily available? I'd like to take a look and see how the sampling could be implemented.

@ninpnin
Copy link
Collaborator

ninpnin commented Mar 13, 2023

Is this and #245 the same thing though?

@fredrik1984
Copy link
Collaborator Author

Yes, it is the same. I don't know if it is on GitHub, but I can send you it through sprend

@MansMeg MansMeg added this to the MP database push milestone Apr 2, 2023
@MansMeg
Copy link
Collaborator

MansMeg commented Apr 2, 2023

I think we should do this by checking all wikidata ids that Emil and Magnus has checked and look in our database that they exist.

@salgo60
Copy link
Contributor

salgo60 commented Apr 4, 2023

@fredrik1984 Maybe related

  • I started a discussion right now on Swedish Wikipedia a discussion about if we can document different sources quality/trust
    • a template I did 2028 about the my understanding of the quality of Riksarkivt SBL
    • my question to Denny the designer of Wikidata and he thoughts of sources and trust video
    • my draft if we can add that to Wikidata T222142 "WikidataCon 2019: We need a better model communicating quality/relevance of sources in Wikidata / Provenance"

image

@salgo60
Copy link
Contributor

salgo60 commented Apr 4, 2023

FYI @fredrik1984 One thing we dont have in Wikidata is school education

e.g. Folkräkningar (Sveriges befolkning) 1930 - Folk_128505791 is about Q16649279 and he has (folkskolans påbyggnader; lägre specialutbildning)

I have seen that SCB publish aggregated data about education for Swedish PM and I have asked for getting it per person with no success - created #102

I will send a question to Swedish Riksarkivet if they have an API and any plans on curating the eduction data...

image

Occupations

In Wikidata its a small chaos but we try to have WD objects for occupations.... I did an overview some Swedish sources (Alvin, SKBL SBL...) usage of Occupation see salgo60/HISCOKoder and tried started an dialogue with no success 2021

@fredrik1984
Copy link
Collaborator Author

Hm, I think school education is the Tvåkammar book, no? See this attached example
Skärmavbild 2023-04-04 kl  14 09 34

@salgo60
Copy link
Contributor

salgo60 commented Apr 4, 2023

@fredrik1984 great and we sometimes add Alma Mater P69 in WD see SPARQL Swedish PM / grouped = 209 different / Map quality unknown

image

image

Uppsala Universitet I guess we should not just have schools also education/skills

I think WD has Alma Mater Uppsala Universitet but I guess it would be more interesting to understand the skill matrix of people in the Swedish parlament and what exams they have... my feeling one reason we dont have better Riksdagens Öppna data is the lack of IT skills in the Swedish PM and they cant understand why they need data as data and a SPARQL endpoint #62 / consequencies of not having 5 stardata #71 / dont see the possibilities connecting EU <-> Kommuner #98....

image

Riksarkivet Folkräkningar (Sveriges befolkning) 1930

The benefits with Riksarkivet is that they have it structured and maybe cleaner 🙏 and we maybe get better trust when data same as Riksarkivet

Feedback Riksarkivet: its in the pipe but looks like they have nothing to show yet...

OT: I did a try with Bildhistoria to map all Swedish Schools but gave up see #1 ... see comment Johannes Westberg that its a rather tough challenge ;-)

@fredrik1984
Copy link
Collaborator Author

Ah, cool! It will be cool to get education etc in the Swerik corpus, although it is not the highest priority for now.

@salgo60
Copy link
Contributor

salgo60 commented Apr 4, 2023

I like the metaphor Google presented at "Applied semantics: beyond the catalog" at 12.29 min that Linked data means that we dont start from scratch all the time instead we use what we find and tries to add value... no one if we do this right needs to encode one more time that Andersson/Olin studied at Harvard... or who studied in the States

image

image

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants