{rfName}
Th

Indexed in

Citations

6

Altmetrics

Analysis of institutional authors

Ortiz-Martinez, DCorresponding Author

Share

June 15, 2024
Publications
>
Article
No

The scaling problem in the pattern recognition approach to machine translation

Publicated to: PATTERN RECOGNITION LETTERS. 29 (8): 1145-1153 - 2008-06-01 29(8), DOI: 10.1016/j.patrec.2007.10.001

Authors:

Ortiz-Martinez, D; Garcia-Varea, I; Casacuberta, F
[+]

Affiliations

Univ Politecn Valencia, Dept Sistemas Informat & Computacio, Valencia, Spain - Author
Univ Politecn Valencia, Dept Sistemas Informat, Valencia, Spain - Author

Abstract

Statistical machine translation (SMT) has proven to be an interesting pattern recognition framework for automatically building machine translations systems from available parallel corpora. In the last few years, research in SMT has been characterized by two significant advances. First, the popularization of the so called phrase-based statistical translation models, which allows to incorporate local contextual information to the translation models. Second, the availability of larger and larger parallel corpora, which are composed of millions of sentence pairs, and tens of millions of running words. Since phrase-based models basically consists in statistical dictionaries of phrase pairs, their estimation from very large corpora is a very costly task that yields a huge number of parameters which are to be stored in memory. The handling of millions of model parameters and a similar number of training samples have become a bottleneck in the field of SMT, as well as in other well-known pattern recognition tasks such as speech recognition or handwritten recognition, just to name a few. In this paper, we propose a general framework that deals with the scaling problem in SMT without introducing significant time overhead by means of the combination of different scaling techniques. This new framework is based on the use of counts instead of probabilities, and on the concept of cache memory. (C) 2007 Elsevier B.V. All rights reserved.
[+]

Keywords

Large-scale pattern recognitionMachine translationPhrase-based translationSearch/decoding algorithmStatistical machine translationStatistical pattern recognition

Quality index

Bibliometric impact. Analysis of the contribution and dissemination channel

The work has been published in the journal PATTERN RECOGNITION LETTERS due to its progression and the good impact it has achieved in recent years, according to the agency Scopus (SJR), it has become a reference in its field. In the year of publication of the work, 2008, it was in position , thus managing to position itself as a Q1 (Primer Cuartil), in the category Computer Vision and Pattern Recognition. Notably, the journal is positioned above the 90th percentile.

Independientemente del impacto esperado determinado por el canal de difusión, es importante destacar el impacto real observado de la propia aportación.

Según las diferentes agencias de indexación, el número de citas acumuladas por esta publicación hasta la fecha 2026-04-03:

  • WoS: 3
[+]

Impact and social visibility

From the perspective of influence or social adoption, and based on metrics associated with mentions and interactions provided by agencies specializing in calculating the so-called "Alternative or Social Metrics," we can highlight as of 2026-04-03:

  • The use, from an academic perspective evidenced by the Altmetric agency indicator referring to aggregations made by the personal bibliographic manager Mendeley, gives us a total of: 11.
  • The use of this contribution in bookmarks, code forks, additions to favorite lists for recurrent reading, as well as general views, indicates that someone is using the publication as a basis for their current work. This may be a notable indicator of future more formal and academic citations. This claim is supported by the result of the "Capture" indicator, which yields a total of: 11 (PlumX).

With a more dissemination-oriented intent and targeting more general audiences, we can observe other more global scores such as:

  • The Total Score from Altmetric: 3.
[+]

Leadership analysis of institutional authors

There is a significant leadership presence as some of the institution’s authors appear as the first or last signer, detailed as follows: First Author (Ortiz Martinez, Daniel) .

the author responsible for correspondence tasks has been Ortiz Martinez, Daniel.

[+]