Lewis John McGibbney created JOSHUA-270:
-------------------------------------------
Summary: pipeline.pl needs major refactoring
Key: JOSHUA-270
URL: https://issues.apache.org/jira/browse/JOSHUA-270
Project: Joshua
Issue Type: Bug
Components: pipeline
Affects Versions: 6.0.5
Reporter: Lewis John McGibbney
Fix For: 6.1
Right now
[pipeline.pl|https://github.com/apache/incubator-joshua/blob/master/scripts/training/pipeline.pl]
is well over 2000 lines long and extremely difficult to navigate.
I propose the following
* All ENV is refactored into an pipeline_environment file
* All Command line parsing and definitions are refactored into a pipeline_cli
file
* Sanity checking is refactored into a pipeline_sanity_check file
* Dependenct Variable Checking is refactored into
pipeline_dependent_variable_setting file
* filter and preprocess corpora is refactored into
pipeline_filter_preprocess_corpora
* pipeline_subsampling becomes a file
* pipeline_alignment becomes a file
* pipeline_parsing becomes a file
* pipeline_thrax becomes a file
* pipeline_tuning becomes a file
* pipeline_testing becomes a file
* pipeline_subreoutines becomes a file
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)