swMATH ID: 27292
Software Authors: Eric Breck
Description: zymake: a computational workflow system for machine learning and natural language processing. Experiments in natural language processing and machine learning typically involve running a complicated network of programs to create, process, and evaluate data. Researchers often write one or more UNIX shell scripts to ”glue” together these various pieces, but such scripts are suboptimal for several reasons. Without significant additional work, a script does not handle recovering from failures, it requires keeping track of complicated filenames, and it does not support running processes in parallel. In this paper, we present zymake as a solution to all these problems. zymake scripts look like shell scripts, but have semantics similar to makefiles. Using zymake improves repeatability and scalability of running experiments, and provides a clean, simple interface for assembling components. A zymake script also serves as documentation for the complete workflow. We present a zymake script for a published set of NLP experiments, and demonstrate that it is superior to alternative solutions, including shell scripts and makefiles, while being far simpler to use than scientific grid computing systems.
Homepage: https://dl.acm.org/citation.cfm?id=1622113
Related Software: Orange4WS; Phrasal; Accord.NET; CNTK; MXNet; StanfordCoreNLP; Caffe; MLlib; Dlib-ml; Orange; Moses; MALLET; Scikit; topicmodels; RCV1; WEKA; Torch; LIBSVM; NLTK
Cited in: 1 Document

Cited in 0 Serials

Citations by Year