Towards Facts Extraction from Texts in the
Polish Language

Tomasz Boiński; Adam Brzeski

Abstrait

Towards Facts Extraction from Texts in the Polish Language

Tomasz Boiński, Adam Brzeski

The Polish language differs from English in many ways. It has more complicated conjugation and declination. Because of that automatic facts extraction from texts is difficult. In this paper we present basic differences between those languages. The paper presents an algorithm for extraction of facts from articles from Polish Wikipedia. The algorithm is based on 7 proposed facts schemes that are searched for in the analyzed text. The analysis includes morphosyntactic tagging, named entity extraction and relation identification. The results acquired for an exemplary Wikipedia text is presented. We indicate the free word formation principle as the main difficulty in the Polish texts analysis. At the same time satisfactory performance of the tagging and analysis tools for the Polish language was confirmed in the conducted experiment.

Avertissement: Ce résumé a été traduit à l'aide d'outils d'intelligence artificielle et n'a pas encore été examiné ni vérifié

Faits saillants de la revue

Adaptatif Algorithmes numériques avancés Architectures informatiques avancées Bioinformatique et biologie computationnelle Calcul en grille Capteurs sans fil Entreposage de données Informatique autonome et contextuelle Logiciels open source Middleware basé sur des agents Protocole de communication CDMA/GSM Réseau ad hoc Réseaux haut débit et intelligents Reconnaissance de modèles/images d’intelligence artificielle Robotique Sécurité de la base de données Structure de données Systèmes de sécurité Technologie calme Technologie radar

Indexé dans

Index Copernicus

Academic Keys

CiteFactor

Cosmos IF

RefSeek

Hamdard University

World Catalogue of Scientific Journals

International Innovative Journal Impact Factor (IIJIF)

International Institute of Organised Research (I2OR)

Cosmos

Revues internationales

Ingénierie Sciences générales Sciences médicales Sciences pharmaceutiques

Revue internationale de recherche innovante en génie informatique et des communications

Abstrait

Towards Facts Extraction from Texts in the Polish Language

Faits saillants de la revue

Indexé dans

Revues internationales

Adresse