-
Notifications
You must be signed in to change notification settings - Fork 0
/
README.txt
69 lines (46 loc) · 2 KB
/
README.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
==============================================
MosesPy
Version 0.2
Main developer: Pierre Lison
URL: github.com/plison/mosespy
Released under an MIT license
==============================================
INTRODUCTION:
=============
The MosesPy interface is a Python package designed to ease the use of the
Moses toolkit for statistical machine translation. It contains a set of
methods for training, tuning and evaluating SMT models.
The interface relies on 3 third-party tools: the Moses toolkit itself
(and its set of processing scripts), the multi-threaded GIZA (MGIZA++),
and the IRSTLM language modelling toolkit.
MosesPy is released under an MIT license. Note that this license only
applies to the MosesPy package, not the third-party tools!
INSTALL:
=============
In order to compile the code for these libraries, simply run the 'install.py'
script:
$ python -m mosespy.install
Note that the installation script will only work for reasonably standard
configurations. If the script fails, you will have to compile the code
manually (see the instructions on the Moses website and the documentation
for MGIZA++ and IRSTLM for details).
RUN:
====
Once Moses, MGIZA++ and IRSTLM are compiled, you are ready to configure
and run translation experiments. A simple example of experiment is
shown in the file run_experiment.py. Of course, you will need to download
the data to run the experiment:
* the training data is available at
[ http://www.statmt.org/wmt13/training-parallel-nc-v8.tgz ]
* the tuning and testing data are available at
[ http://www.statmt.org/wmt12/dev.tgz ]
As you might notice, this experiment is a replica of the baseline setup
described in http://www.statmt.org/moses/?n=Moses.Baseline, but is
substantially easier to run. The final BLEU score at the end of this
experiment example should be around 23.50.
MORE INFO:
==========
You will soon find more documentation on the MosesPy website:
github.com/plison/mosespy
Stay tuned!