Jenna Kanerva profiilikuva
Jenna
Kanerva
projektitutkija, data-analytiikka
tutkijatohtori, data-analytiikka

Ota yhteyttä

Asiantuntijuusalueet

kieliteknologia
luonnollisen kielen prosessointi
koneoppiminen
korpukset
annotointi

Biografia

I am a doctoral researcher at the Department of Computing, University of Turku. I’m working as a part of the TurkuNLP research group focusing on language technology and natural language processing (NLP) related topics. I got my Master of Science degree in 2014 at the University of Turku (major subject computer science).

Opetus

Starting from the year 2014, I have acted as a responsible/co-responsible person for the Introduction to Language Technology course lectured at the University of Turku each year. In addition to this, I have been lecturing/co-lecturing several courses/lectures related to language technology at the University of Turku, as well as being invited to give lectures as part-time teacher at the Arcada University of Applied Sciences and the University of Tampere (Pori unit). In order to advance as a teacher, I have completed a 25 ECTS study module of university pedagogy within the years 2019-2021.

Tutkimus

My PhD research focuses on the area of language technology, especially being interested in machine learning based methods for Finnish language processing. I also greatly enjoy and respect elementary corpus work after being part of the data collection and annotation effort of several language data resources built for Finnish language at the TurkuNLP group. After building the elementary resources, these datasets are used to develop several language processing tools based on the latest machine learning methods.

Julkaisut

Järjestä:

Semantic search as extractive paraphrase span detection (2024)

Language Resources and Evaluation
Kanerva Jenna, Kitti Hanna, Chang Li-Hsin, Vahtola Teemu, Creutz Mathias, Ginter Filip
(Vertaisarvioitu alkuperäisartikkeli tai data-artikkeli tieteellisessä aikakauslehdessä (A1))

FinGPT: Large Generative Models for a Small Language (2023)

Conference on Empirical Methods in Natural Language Processing
Luukkonen Risto, Komulainen Ville, Luoma Jouni, Eskelinen Anni, Kanerva Jenna, Kupari Hanna-Mari, Ginter Filip, Laippala Veronika, Muennighoff Niklas, Piktus Aleksandra, Wang Thomas, Tazi Nouamane, Scao Le Teven, Wolf Thomas, Suominen Osma, Sairanen Samuli, Merioksa Mikko, Heinonen Jyrki, Vahtola Aija, Antao Samuel, Pyysalo Sampo
(Vertaisarvioitu artikkeli konferenssijulkaisussa (A4))

GEMv2: Multilingual NLG Benchmarking in a Single Line of Code (2022)

Empirical Methods in Natural Language Processing
Gehrmann S., Bhattacharjee A., Mahendiran A., Wang A., Papangelis A., Madaan A., McMillan-Major A., Shvets A., Upadhyay A., Bohnet B., Yao B., Wilie B., Bhagavatula C., You C., Thomson C., Garbacea C., Wang D., Deutsch D., Xiong D., Jin D., Gkatzia D., Radev D., Clark E., Durmus E., Ladhak F., Ginter F., Winata G.I., Strobelt H., Hayashi H., Novikova J., Kanerva J., Chim J., Zhou J., Clive J., Maynez J., Sedoc J., Juraska J., Dhole K., Chandu K.R., Perez-Beltrachini L., Ribeiro L.F.R., Tunstall L., Zhang L., Pushkarna M., Creutz M., White M., Kale M.S., Eddine M.K., Daheim N., Subramani N., Dusek O., Liang P.P., Ammanamanchi P.S., Zhu Q., Puduppully R., Kriz R., Shahriyar R., Cardenas R., Mahamood S., Osei S., Cahyawijaya S., Štajner S., Montella S., Jolly S., Mille S., Hasan T., Shen T., Adewumi T., Raunak V., Raheja V., Nikolaev V., Tsai V., Jernite Y., Xu Y., Sang Y., Liu Y., Hou Y.
(Vertaisarvioitu artikkeli konferenssijulkaisussa (A4))

Textual Paraphrase Dataset for Deep Language Modelling (2022)

Kanerva Jenna, Ginter Filip, Chang Li-Hsin, Skantsi Valtteri, Kilpeläinen Jemina, Kupari Hanna-Mari, Piirto Aurora, Saarni Jenna, Sevón Maija, Tarkka Otto
(Vertaisarvioitu artikkeli kokoomateoksessa (A3))