Analyse automatique du grec ancien par réseau de neurones. Évaluation sur le corpus De Thessalonica Capta
DOI:
https://doi.org/10.14428/babelao.vol1011.2022.65073Keywords:
Natural Language Processing (NLP), Lemmatisation, POS-tagging, Ancient Greek, John Anagnostes, Eusthatios of Thessalonike, John KaminiatesAbstract
The DTC corpus brings together historical texts written in Greek during the Byzantine period. These texts were analyzed semi-automatically (lemmatization and POS-tagging) by using computer tools and linguistic resources of the GREgORI project (UCLouvain, Louvain-la-Neuve, Belgium) specialized in the NLP of Greek and the languages of the Christian East. A second analysis was carried out in collaboration with the company Calfa (Paris, France) developping NLP tools for Armenian and implementing approach relating to artificial intelligence. This second analysis is performed by a neural network. This study compares and evaluates the results produced by the two methods and proposes a hybrid approach for the processing of the languages concerned.
Downloads
Published
How to Cite
Issue
Section
License
These papers are licensed under a Creative Commons Attribution - Non Commercial - No Derivatives 4.0 International License.
Consequently, the readers are authorised to Share (copy and redistribute the material in any medium or format) under the following terms :
- Attribution — You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use ;
- NonCommercial — You may not use the material for commercial purposes ;
- NoDerivatives — If you remix, transform, or build upon the material, you may not distribute the modified material.