ADoBo — Automatic Detection of Borrowings

Results of ADoBo 2021

Results of ADoBo shared task 2021 can be checked out on the following publications:

Participating systems

Task overview

ADoBo session at IberLEF 2021

We will be holding a session on ADoBo as part of IberLEF 2021 workshop. The session will happen on September 21st at 11.00am-12.30 (CEST) and will be held online.

More information on ADoBo session here

What is ADoBo?

ADoBo is the shared task on automatic detection of borrowings. For the first edition of ADoBo, we propose a shared task on detecting direct, unadapted, emerging borrowings in the Spanish press, i.e. detecting lexical borrowings that appear in the Spanish press and that have recently been imported into the Spanish language (words like running, smartwatch, influencer or youtuber). Participants will be provided a corpus of Spanish newswire annotated with lexical borrowings, and will have to create systems that can automatically detect lexical borrowings from Spanish newswire.

[What is a lexical borrowing? Why is borrowing detection an interesting task?]

The task will run from February 2021 to June 2021 and is part of IberLEF 2021, which will take place in September 2021 in Spain.

Dataset

A corpus of Spanish newswire will be distributed among participants. The articles will be annotated with direct, unadapted, emerging lexical borrowings, i.e. lexical borrowings that have been imported into Spanish language and that haven’t been assimilated yet (words such as look, hype, cliffhanger or lawfare). Annotation guidelines can be found here.

Borrowings will be annotated with BIO labels with two possible categories: ENG for English borrowings and OTHER for lexical borrowings from other languages (non lexical borrowings will have the tag O). Only unadapted lexical borrowings will be considered. This means that borrowings that have already gone through orthographical or morphological adaption (such fútbol or hackear) will not be labeled as borrowings.

El O
escándalo O
de O
las O
tarjetas O
' O
black B-ENG
' O

Participants will be provided with annotated versions of the training and development set, and an unannotated test set. Participants are expected to submit the annotated test set produced by their system.

A sample set has been made available. The purpose of this sample set is for participants to get familiar with the dataset format.

Evaluation

Submissions will be evaluated using the standard precision, recall and F1 over spans:

F1-measure will be used as the official evaluation score, and will be used for the final ranking of the participating teams.

It should be noted that the evaluation for the final ranking will be done exclusively at the span level. This means that only full matches will be considered, and no credit will be given to partial matches, i.e. given the multitoken borrowing late night, the entire phrase would have to be correctly labeled in order to count as a true positive.

Scoring your output

The precision, recall, and F1 metrics will be computed using SeqScore, a new Python package for sequence labeling evaluation. It works similarly to the classic conlleval.pl scorer and the seqeval package in that it decodes BIO labels in a way that tolerates some improper sequences (for example, starting an entity with I- instead of B-).

To evaluate your output, simply install (or upgrade) SeqScore as follows: pip install -U seqscore. Once it’s installed, you can call it from the command line as the main script has been installed in your Python environment.

If the reference (annotation) is located at dev.conll and your predictions are located at output.conll, you would simply run seqscore score output.conll --reference dev.conll.

The output will look like this:

| Type   |   Precision |   Recall |     F1 |   Reference |   Predicted |
|--------|-------------|----------|--------|-------------|-------------|
| ALL    |      100.00 |   100.00 | 100.00 |           7 |           7 |
| ENG    |      100.00 |   100.00 | 100.00 |           7 |           7 |

You can change the format of the output by using the --display flag, for example --display delim will get you a tab-delimited version of that table (you can configure the delimiter using the --delim flag).

Scoring can only be performed on BIO encoded output. However, as we continue to prepare tools for the task, we will provide scripts that can convert back and forth from BIO to other formats (such as BIOES/BILOU).

SeqScore is being continuously updated, so you can always run pip install -U seqscore to get the latest.

Schedule

Results

Rank Team Tag Precision Recall F1
1 marrouviere ALL
ENG
OTHER
88.81
90.70
47.06
81.56
82.65
52.17
85.03
86.49
49.48
2 marrouviere ALL
ENG
OTHER
89.40
90.98
45.45
66.30
67.55
32.61
76.14
77.54
37.97
3 marrouviere ALL
ENG
OTHER
92.28
93.43
38.89
61.40
63.12
15.22
73.75
75.34
21.88
4 versae ALL
ENG
OTHER
62.76
62.97
45.45
46.30
47.62
10.87
53.29
54.23
17.54
5 mgrafu ALL
ENG
OTHER
65.15
65.31
50.00
37.82
38.90
08.69
47.86
48.76
14.81
6 Neakail ALL
ENG
OTHER
75.27
75.43
60.00
27.47
28.25
6.52
40.25
41.10
11.76
7 Neakail ALL
ENG
OTHER
76.29
76.48
60.00
25.29
25.99
6.52
37.99
38.80
11.76
8 Neakail ALL
ENG
OTHER
76.44
76.64
60.00
24.75
25.42
6.52
37.39
38.18
11.76

How to participate?

ADoBo shared task is aimed at participants working in code-mixed data and those interested in the intersection of neology, lexicography and NLP. The dataset distribution and submissions will be managed via CodaLab.

Participants (and anyone interested in the topic of borrowing detection) are welcome to join the Google group at adobo-task[@]googlegroups.com

Organization Committee

Contact