StatusThe thesis was presented on the 23 May, 2008
Approved by NCAA on the 19 June, 2008
Abstract– 0.79 Mb / in romanian
ThesisCZU 004.912 2.12 Mb / in romanian
This work represents a study accomplished for the automatic analysis of the business letters with the aim to elaborate the technology and necessary instruments in order to create software in Assisted Machine Translation. It is based on the bilingual English- Romanian parallel corpora of business letters (42 300 words), and specialized dictionaries.
The thesis is based on synergy between Linguistics and Informatics, that’s why the issues approach was through analogies, synthesis and correlation of the results.
We have proposed the analysis of Commercial Language at different levels, including morphological, syntactic, semantic, stylistical and vocabulary. We bring arguments of its existence at each level.
This work also includes: a general presentation of the Natural Language Processing where we describe the Supervised Machine Learning method, the Synergy between Linguistics and Informatics, the Romanian studies and realizations in NLP, the architecture and levels of an NLP system, some of syntax (parsing), semantics (λ - calculus), pragmatics, discourse and Machine Translation issues.
We’ve created a generic ontology of business concepts, which can be enlarged by Ontology Editor Protégé.
We examine the parallel English-Romanian corpora of business letters and present the results of their computational analysis. There are suggestions on terminological and ontological analysis of business language; the basic elements and methodology of Machine Assisted Translation of business letters. We have extracted 1831 pairs of terminological units, for the complex terms were being created 43 models of transformational rules of English Noun Part (NP) into Romanian NP.
Using the elaborated computational resources, we’ve created software BusLet Translator. It is based on Object Oriented Programming, Translation Memory and the bilingual Dictionary of business terms.
This research points out the value aspect of the conceptual frame, from which have been obtained new scientific results. The theoretical research it is being used in teaching course of Computational Linguistics. We specify the necessity of subsequent scientific and didactic contributions.