Skip to content

Latest commit

 

History

History
90 lines (69 loc) · 3.99 KB

README.md

File metadata and controls

90 lines (69 loc) · 3.99 KB

Norwegian slot and intent detection and dialect identification (NorSID)

Shared task website: https://sites.google.com/view/vardial-2025/shared-tasks

Training data

All training and development data of the xSID-0.6 dataset may be used for training.

In addition, we provide a machine-translated version of the English training corpus to Norwegian, with projected annotations: nb.projectedTrain.conll.fixed. The quality of both the translation and the annotation projection is relatively poor. Participants wishing to improve the annotation projection may find it helpful to use the existing xSID code.

Development data

The development data can be found in norsid_dev.conll.

Example:

# id = 33/8
# text = Kor varmt skal det ver i dag?
# intent = weather/find
# dialect = V
1   Kor     weather/find   O
2   varmt   weather/find   B-weather/attribute
3   skal    weather/find   O
4   det     weather/find   O
5   ver     weather/find   O
6   i       weather/find   B-datetime
7   dag     weather/find   I-datetime
8   ?       weather/find   O
  • The id field contains the sentence id (33) and the translator id (8), separated by /.
    • Sentence ids range from 1 to 300 in the development set. All sentences with the same id are translations of each other, i.e. sentence 78/1 is just another dialectal variant of sentence 78/2.
    • Translator ids range from 1 to 10 as well as B (Bokmål).
  • The text field represents the detokenized prompt string.
  • The intent field contains the name of the intent associated with the prompt.
    • Some intent labels contain / but others don't.
  • The dialect field contains the dialect label (one uppercase character).
    • There are four dialect labels: V, N, T, B (see Table below for details).
  • The numbered lines contain the tokens of the prompt, together with the intent label and the slot annotations.
    • Some slot labels contain / but others don't (e.g. weather/attribute vs. datetime).

The correspondences between dialect areas and translator ids are as follows:

Translator ID Origin Dialect area
1 Tromsø N (North Norwegian)
2 Tromsø area N (North Norwegian)
3 Trondheim T (Trøndersk)
4 Trondheim T (Trøndersk)
5 Sunndal T (Trøndersk)
6 Ålesund area V (West Norwegian)
7 Haugesund V (West Norwegian)
8 Stavanger V (West Norwegian)
9 Stavanger V (West Norwegian)
10 Bergen V (West Norwegian)
B B (Bokmål)

Test data

The test set has been released: /norsid_test_nolabels.conll. It contains the untokenized and tokenized prompt strings as well as placeholders for the intent and dialect labels:

# text = Ka blir temperaturen i dag?
# intent = 
# dialect = 
1	Ka
2	blir
3	temperaturen
4	i
5	dag
6	?

Evaluation

You can find the evaluation scripts in the scripts directory. The official metrics are:

  • slot f1 for the slots
  • intent accuracy for the intents
  • weighted f1-score for the dialects.

If you only participate in the sentence-level tasks (intent detection and/or dialect identification), you can submit a simpler file containing only lines starting with # and blank lines (see example norsid_dev_random.conll).

Results

The official results of the shared task are provided on a different page. The detailed output of the evaluation script is provided in participants/detailed_results.txt. We provide the following baselines:

  • For slot and intent detection: mBERT fine-tuned on the English xSID training data with the MaChAmp toolkit.
  • For dialect detection: an SVM classifier trained on the NorSID development data using tf-idf-weighted character 1-gram to 4-gram features.