EDP Sciences logo
Web of Conferences logo
Open Access
Numéro
SHS Web Conf.
Volume 191, 2024
9e Congrès Mondial de Linguistique Française
Numéro d'article 11004
Nombre de pages 17
Section Ressources et outils pour l’analyse linguistique
DOI https://doi.org/10.1051/shsconf/202419111004
Publié en ligne 28 juin 2024
  • Antoun, W., Sagot, B., Seddah, D. (2023). Data-Efficient French Language Modeling with CamemBERTa, Findings of the Association for Computational Linguistics: ACL 2023, Toronto, Canada: Association for Computational Linguistics, p. 5174–5185. [Google Scholar]
  • Arnardóttir, Þ., Hafsteinsson, H., Sigurðsson, E. F., Bjarnadóttir, K., Ingason, A. K., Jónsdóttir, H., Steingrímsson, S. (2020). A Universal Dependencies Conversion Pipeline for a Penn-format Constituency Treebank. In de Marneffe, Marie-Catherine & de Lhoneux, Miryam & Nivre, Joakim & Schuster, Sebastian (Eds.), Proceedings of the Fourth Workshop on Universal Dependencies (UDW 2020), Barcelona, Spain (Online): Association for Computational Linguistics, p. 16–25. [Google Scholar]
  • Arnardóttir, Þ., Hafsteinsson, H., Jasonarson, A. Ingason, A. K, Steingrímsson, S. (2023). Evaluating a Universal Dependencies conversion pipeline for Icelandic. In Alumäe, Tanel & Fishel, Mark (eds.), Proceedings of the 24th nordic conference on computational linguistics (NoDaLiDa). Tórshavn, Faroe Islands: University of Tartu Library, p. 698–704. [Google Scholar]
  • Arnardóttir, Þ. (2024). UDConverter. Version 1.1. https://github.com/thorunna/UDConverter [Google Scholar]
  • Bird, S., Klein, E., Loper, E. (2009). Natural language processing with Python. Beijing / Cambridge Mass: O’Reilly. [Google Scholar]
  • Choi, J.D., Palmer, M. (2010). Robust Constituent-to-Dependency Conversion for English. In Dickinson, Markus, Müürisep, Kaili & Passarotti, Marco. (Eds.), Proceedings of the Ninth International Workshop on Treebanks and Linguistic Theories, p. 55–66. [Google Scholar]
  • Ide, N., Suderman, K. (2007). GrAF: A Graph-based Format for Linguistic Annotations. Proceedings of the Linguistic Annotation Workshop, Prague, Czech Republic: Association for Computational Linguistics, p. 1–8. [Google Scholar]
  • ISO 24612. (2012). Language resource management – Linguistic annotation framework (LAF). Technical Committee: ISO/TC 37/SC 4. https://www.iso.org/standard/37326.html [Google Scholar]
  • Eckart, K., Heid, U. (2014). Resource interoperability revisited. In Ruppenhofer, Josef & Faaß, Gertrud (Eds.), Proceedings of the 12th edition of the KONVENS conference, Hildesheim, Germany, October 8–10, Universitätsbibliothek Hildesheim, p. 116–126. [Google Scholar]
  • Grobol, L., Regnault, M., Ortiz, S. P. J., Sagot, B., Romary, L., Crabbé, B. (2022). BERTrade: Using Contextual Embeddings to Parse Old French. Proceedings of the Thirteenth Language Resources and Evaluation Conference. European Language Resource Association, p. 1104–1113. [Google Scholar]
  • Guillaume, B. (2019). Graph matching for corpora exploration, JLC 2019 - 10èmes Journées Internationales de La Linguistique de Corpus. Grenoble, France. https://inria.hal.science/hal-02267475 [Google Scholar]
  • Guillot, C., Heiden, S., Lavrentiev, A. (2017). Base de français médiéval: une base de référence de sources médiévales ouverte et libre au service de la communauté scientifique. Diachroniques 7, p. 168–184. [Google Scholar]
  • Kodner, J. (2019). Estimating child linguistic experience from historical corpora. Glossa: a journal of general linguistics 4(1). 122, p. 1–14. [Google Scholar]
  • Kroch, A., Santorini, B. (2021). Penn-BFM Parsed Corpus of Historical French, version 1.0. https://github.com/beatrice57/mcvf-plus-ppchf. [Google Scholar]
  • Le, H., Vial, L., Frej, J., Segonne, V., Coavoux, M., Lecouteux, B., Allauzen, A., Crabbé, B., Besacier, L., Schwab, D., (2020). FlauBERT: Unsupervised language model pre-training for French, in: Calzolari, N. et al. (eds.), Proceedings of the Twelfth Language Resources and Evaluation Conference. European Language Resources Association, Marseille, France, p. 2479–2490. [Google Scholar]
  • Martineau, F. (2008). Un Corpus pour l’analyse de la variation et du changement linguistique. Corpus 7. [Google Scholar]
  • Martineau, F., Hirschbühler, P., Kroch, A., Morin, Y.-C. (2021). Modéliser le changement: les voies du français. MCVF corpus version 2.0. [Google Scholar]
  • Nivre, J., de Marneffe, M-C., Ginter, F., Goldberg, Y., Hajič, J., Manning, Ch. D., McDonald, R., Petrov, S., Pyysalo, S., Silveira, N., Tsarfaty, R., Zeman, D. (2016). Universal Dependencies v1: A multilingual treebank collection. Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), Paris, France. European Language Resources Association, p. 1659–1666. [Google Scholar]
  • Prévost, S., Stein, A. (eds.) (2013). Syntactic Reference Corpus of Medieval French (SRCMF). Lyon/Stuttgart: ENS de Lyon; Lattice, Paris; Universität Stuttgart. Version étendue UD «Profiterole» publiée par S. Prévost, et al. https://universaldependencies.org/treebanks/fro_profiterole/index.html [Google Scholar]
  • Randall, B., Kroch, A., Santorini, B. (2004). CorpusSearch 2 users guide. https://www.ling.upenn.edu/~beatrice/corpus-ling/CS-users-guide/[28.12.23]. [Google Scholar]
  • Regnault, M., Prévost, S., Villemonte De La Clergerie, E., (2019). Challenges of language change and variation: towards an extended treebank of Medieval French, Proceedings of the 18th International Workshop on Treebanks and Linguistic Theories (TLT, SyntaxFest 2019). Association for Computational Linguistics, Paris, France, pp. 144–150. https://doi.org/10.18653/v1/W19–7816 [Google Scholar]
  • Taylor, A., Marcus, M, Santorini, B. (2003). The Penn Treebank: An Overview, In Abeillé, A. (Ed.), Treebanks, Text, Speech and Language Technology. Springer Netherlands, Dordrecht, p. 5–22. [CrossRef] [Google Scholar]
  • Troberg, M. (2013). Change of Object Expression in the History of French. Newcastle upon Tyne: Cambridge Scholars Publications. [Google Scholar]
  • Villemonte de la Clergerie, E., Mostefa, D., Ayache, Ch., Paroubek, P., Hamon, O., Vilnat, A. (2008). PASSAGE: from French parser evaluation to large sized treebank. In Nicoletta Calzolari (Conference Chair), Bente Maegaard, Joseph Mariani, Jan Odijk, Stelios Piperidis, Daniel Tapias, Khalid Choukri (éds.), Proceedings of the sixth international conference on language resources and evaluation (LREC’08). Marrakech, Morocco: European Language Resources Association (ELRA). [Google Scholar]
  • Vilnat, A., Paroubek, P., Villemonte de La Clergerie, É., Francopoulo, G., Guénot, M-L. (2010). PASSAGE syntactic representation: a minimal common ground for evaluation. Seventh international conference on language resources and evaluation - LREC 2010. La Vallete, Malta. [Google Scholar]
  • Wallenberg, J. C., Ingason, A. K., Sigurðsson, E. F., Rögnvaldsson, E. (2011). Icelandic Parsed Historical Corpus (IcePaHC). Version 0.9. http://www.linguist.is/icelandic_treebank. [Google Scholar]