J.DepP - C++ implementation of Japanese Dependency Parsers

developed by Naoki Yoshinaga at Yoshinaga Lab., IIS, University of Tokyo

About

J.DepP is a C++ implementation of Japanese dependency parsing algorithms [1,2,3,4]. It takes a raw sentence as input and performs word segmentation, POS tagging (thanks to MeCab), bunsetsu chunking and dependency parsing. Syntactic parsers have been believed to be (significantly) slower than front-end part-of-speech taggers, and it is rarely utilized in industry that needs to handle massive texts (e.g., microblogs). The inefficiency of parsers is, however, just because researchers paid attention mostly to accuracy and have not seriously pursued an efficient implementation. J.DepP is meant for those who want to parse massive texts (e.g., entire blog feeds or microblogs); J.DepP is even faster than most of the front-end morphological analyzer (parsing >10000 sentences per second), while achieving the state-of-the-art parsing accuracy.

If you make use of J.DepP for research or commercial purposes, the reference (optional) will be:

N. Yoshinaga and M. Kitsuregawa. A Self-adaptive Classifier for Efficient Text-stream Processing. Proc. COLING 2014, pp. 1091--1102. 2014 (used for testing a parser)
N. Yoshinaga and M. Kitsuregawa. Kernel Slicing: Scalable Online Training with Conjunctive Features. Proc. COLING 2010 (oral), pp. 1245--1253. 2010. (used for training a parser)
N. Yoshinaga and M. Kitsuregawa. Polynomial to Linear: Efficient Classification with Conjunctive Features. Proc. EMNLP 2009, pp. 1542--1551. 2009. A longer journal version is here. (used for testing a parser)

Features

Extremely efficient parsing powered by pecco: J.DepP processes 11737 / 15557 raw sentences per second in news domain / blog feeds on 11-inch MacBook Air (Mid 2011) with 1.8 GHz Intel Core i7 CPU (cf. MeCab analyzes 27378 / 32164 raw sentences per second in news domain / blog feeds).
State-of-the-art parsing accuracy powered by opal [-l 0] (partial: ~92.1% and complete: ~57.7% on news articles): you can train this parser in a minute; training the chunker and dependency parser from the training split of Kyoto University Text Corpus takes just 11.5s and 35.9s on the above MacBook Air, respectively.
NOTE: The accuracy of J.DepP (or other statistical parsers) depends on the quality and quantity of the training corpus, so J.DepP achieves the state-of-the-art accuracy when the compared parsers are trained with the same corpus. By default, J.DepP uses freely-available Kyoto-University and NTT Blog Corpus to train a model, in order to isolate users from issues related to the corpus license. The resulting J.DepP is faster but less accurate than J.DepP trained with Kyoto University Text Corpus.

The best accuracy will be obtained by using a larger corpus and richer features for training:
```
> configure --with-corpus=kyoto+knbc --with-classifier=3rdPolyPMT
> make model
```
If you do not have Kyoto University Text Corpus but need this accurate model for research or personal purposes, e-mail me (see AUTHORS for the address).
Alternative parsing algorithms: shift-reduce (or transition-based) [3] [-p 0], cascaded chunking [2] [-p 1], backward [1] [-p 2], and tournament [4] [-p 3] algorithms are implemented. You may want to stick to shift-reduce parser [-p 0] (default) for practical purposes since it is not only the most efficient among all (O(n)) but also the most accurate (partly because features are tuned for this algorithm). Note: this default algorithm [-p 0] does not return probabilities for dependencies whose heads are the final chunks (returns 0 by -v -1). This is because the final chunks are the last candidates for the target chunks to search their heads in the algorithm; the algorithm thus determines the dependency without running a classifier that outputs scores.
Compatible with major morphological analyzers: JUMAN or MeCab (default); (--with-postagger=juman|mecab). Or you can integrate MeCab into J.DepP to build a stand-alone dependency parser (--enable-standalone). For MeCab, jumandic (default), ipadic, NAIST-jdic, and UniDic are supported (--with-mecab-dict=JUMAN|IPA|NAIST-J|UNI).
Various machine learning algorithms to train a parser: SVM, MaxEnt [5], and online learning.
SWIG-based Perl/Python/Ruby/Lua bindings: you can use J.DepP from your favorite script languages, although it may underminethe virtue of J.DepP (efficiency); follow instructions under swig/.

License: GNU GPLv2, LGPLv2.1, and BSD; or e-mail me for other licenses you want.

Download & Setup

> wget http://www.tkl.iis.u-tokyo.ac.jp/~ynaga/jdepp/jdepp-latest.tar.gz
> tar zxvf jdepp-latest.tar.gz
> cd jdepp-YYYY-MM-DD

# 1) train a parser with Kyoto-University and NTT Blog (KNB) Corpus (default)
> configure

# or train a parser with Kyoto University Text Corpus (KyotoCorpus4.0 required)
> configure --with-corpus=kyoto
> ln -s PATH_TO_KYOTO_CORPUS/KyotoCorpus4.0

# or train a parser with Kyoto University Text Corpus (KyotoCorpus4.0 required)
#  and KNB Corpus (CaboCha seems to use these corpora for training)
> configure --with-corpus=kyoto+knbc
> ln -s PATH_TO_KYOTO_CORPUS/KyotoCorpus4.0

# or train a parser with Kyoto University Text Corpus w/o Mainichi news articles
# Caveats: this option changes the feature set, so models trained w/o this option
#          are not compatible with the resulting jdepp binary
> configure --with-corpus=kyoto-partial --disable-autopos-train

# 2) make model using the entire corpus (for slightly better accuracy)
# Caveats: make calls scripts in tools/, which needs python3 (2022-03-18 or later)
> make model && make install

# or make model using a part (standard training split, if any) of the corpus,
#   if you want to know the accuracy of the installed parser
> make model-benchmark && make install

# See usage:training for other configuration options in building a model.

## (optional) MaxEnt [-l 2] requires Tsuruoka's MaxEnt implementation.
> wget http://www.logos.ic.i.u-tokyo.ac.jp/~tsuruoka/maxent/maxent-3.0.tar.gz
> cd src && ln -s ../maxent-X.Y/*.{cpp,h} . && cd ..
> configure --enable-maxent

For Mac OS X users: try port jdepp via MacPorts (special thanks to @hjym_u); it will build a standalone parser (--enable-standalone) using KNB corpus (--with-coprus=knbc) with auto POSs given by MeCab/jumandic (default).

Requirement

OS: 32/64-bit UNIX-compatible OS (tested on Linux / Mac OS X)
Compiler: tested with GNU gcc (≥ 4.0) or clang (≥ 2.9)
Library (optional): MeCab or Juman (if you want to evaluate only bunsetsu chunking or dependency parsing, configure --disable-autopos-train), Darts (by Taku Kudo) or darts-clone (by Susumu Yata; recommended) can be used to store a feature dictionary. TinySVM or Tsuruoka's MaxEnt Estimator can be used to train an SVM / MaxEnt classifier.
Data (optional): Kyoto University Text Corpus or a corpus in the same format for training a parser (note: J.DepP tries to learn a model from automatically downloaded Kyoto-University and NTT Blog corpus since version December 25th, 2013; before that, you need to configure J.DepP with --with-corpus=knbc).

ToDo

Support compressed dependency database output.
Native C++ implementation of to_chunk.py or to_tree.py.
Implement an efficient word segmenter / POS tagger.
Provide pre-trained models (e-mail me if you need).

History

March 18th, 2022 (xz) (development; minor bug/typo fixes and docs only):
- Support Python3 instead of Python (now you can compile J.DepP on MacOS Monterey 12.3)
- Fix a bug in parse_from_postagged () and read_result () for SWIG bindings.
January 23rd, 2022 (xz):
- Change an option of curl from -O to -LO to follow redirect in retrieving KNBC corpus (thanks: Ahmed Fasih).
- Fix a bug that causes segmentation fault when giving a large input with size close to the IOBUF_SIZE (thanks: Kay).
- Fix a bug in building a feature dictionary for inflections of words only with major POSs (auxiliary verbs) in MeCab unidic (a little impact on the accuracy).
August 16th, 2018 (xz):
- Fix a bug in using to_chunk.py with mecab-unidic (outputs include empty fields) (thanks: Ahmed Fasih).
April 10th 2018 (patch only)
- This is a patch to enable a hidden feature of stacking J.DepP over another dependency parser (originally implemented around 2011). Training J.depP with this patch, its accuracy on Kyoto-University Text Corpus 4.0 will improve from 92.09% to 92.92% (default parameters, stacking knp-4.16 with juman-7.0) or 93.19% (with 3rd-order polynomial kernel, stacking knp-2.0 with juman-5.1).
- NOTE: The resulting parser loses the virtue of J.DepP (speed). Practioners may want to use a technique called structure compilation [Liang+ ICML 2008] or knowledge distillation to approximate the stacked parser with pure J.DepP to recover parsing speed while keeping the achieved accuracy.
October 5th 2015 (xz):
- Change License from GNU GPL/LGPL to GNU GPL/LGPL and 2-clause BSD license.
February 8th, 2015 (xz):
- Apply the patch to fix a bug in parse_tostr () and parse_from_postagged_tostr ().
June 22nd, 2014 (xz):
- Support training with universal dependency treebank (configure --with-corpus=universal; experimental)
- Add SWIG-based Perl/Python/Ruby/Lua bindings (experimental).
- Add a configuration option to choose classifiers (configure --with-classifier=Linear|2ndPolyFST|3rdPolyPMT).
  - speed: Linear >> 2ndPolyFST > 3rdPolyPMT
  - accuracy: Linear << 2ndPolyFST ≤ 3rdPolyPMT.
- There is a bug in parse_tostr () and parse_from_postagged_tostr () (thanks to Dr. Shinzato).
  Apply this workaround patch to J.DepP to fix this.
December 25th, 2013 (xz):
- Package with the latest version of cedar.
- Change a default training corpus to Kyoto-University and NTT Blog Corpus (configure --with-corpus=kyoto to train with Kyoto University Text Corpus).
- Support the latest version of UNIdic (2.1.2) (partial: ~91.7% and complete: 56.2% on news article).
- Speed up to_chunk.py (x1.5) and to_tree.py (x1.2).
- Support training with both Kyoto University Text Corpus and Kyoto University and NTT Blog Corpus (configure -with-corpus=kyoto+knbc).
- Add a command-line option [-i STR] to jdepp, to_chunk.py and to_tree.py, to ignore lines starting with STR.
- make model now takes advantage of all the corpus for training (improve the accuracy slightly).
- make model-benchmark performs training and testing with standard splitting (same as make model in the previous versions).
- Fix a configuration error in clang (Mac OS X Mavericks).
January 23rd, 2013 (xz):
- Package with the latest version of opal & pecco; a bug fix when configuring --enable-float.
- Under certain x86-64 systems with broken std::strncmp(), J.DepP fails to parse (morphologically analyzed) sentences with morphemes starting with 'E' (thanks to Dr. Ohtake).
  Apply this workaround patch provided by the bug reporter to J.DepP to fix this.
```
> cd jdepp-2013-01-23
> wget http://www.tkl.iis.u-tokyo.ac.jp/~ynaga/jdepp/jdepp.patch
> patch -p1 < jdepp.patch
```
December 12th, 2012 (xz):
- Package with the latest version of opal & pecco.
- Remove most of the static variables (verified with clang++ -Weverything).
November 8th, 2012 (xz):
- Support Unidic as a morphological dictionary (configure --with-mecab-dict=UNI) (experimental).
- Revisit replace_pos.py to perform robust and x4 fast corpus generation.
- Tune default hyper-parameters to train/test chunker/parser.
- Fix a bug in parsing a sentence with 1-byte characters (thanks to Mr. Chungen Li).
October 30th, 2012 (xz):
- Support training with KyotoCorpus4.0 annotation w/o Mainichi news articles (configure -with-corpus=kyoto-partial).
- Clean up codes for generating features; sweep away if-else, and place assertion to guarantee valid feature indices.
- Add workaround features to capture local coordination (+ feature engineering, partial accuracy +0.2%; complete accuracy +0.5%).
- Add quiet option [-q] to to_chunk.py and to_tree.py; display only incorrectly chunked/parsed sentences.
- Fix a bug in generating features for chunking [-t 1] (for a sentence with less than three morphemes)
- Change color scheme in to_chunk.py and to_tree.py; more pretty printing.
October 6th, 2012 (xz):
- Support a probability output for [-l 0,1] (sigmoid fitting is done by opal & pecco).
September 26th, 2012 (xz):
- Support a probability output [-v -1] for [-I 0] (for [-l 0,1], standard sigmoid function is used to normalize a margin [-l 0,1]).
- Support half-width characters in displaying a dependency structure (tool/to_tree.py)
- Exclude wrongly included MaxEnt codes (from the previous distributions as well).
- Add a workaround for a gcc bug in value initialization.
September 11th, 2012 (xz):
- Package with the latest version of opal & pecco (x1.5-2 speed up in parsing).
- Add opal option [-p] in training (x1.5 speed-up in training).
May 17th, 2012:
- Support GNU autoconf/automake (thanks to Dr. Shinzato).
- Change license from GNU GPL to GNU GPL/LGPL.
- Support training with Kyoto-University and NTT Blog Corpus.
- Support easy-going training/testing with major training corpora (configure --with-corpus=kyoto|knbc && make model).
- Support NAIST-jdic as a morphological dictionary (configure --with-mecab-dict=NAIST-J).
- Support installation of a trained model.
- Support a probability output for [-I 1,2].
- Support human-readable chunk outputs (to_chunk.py)
- Change default character encoding from EUC-JP to UTF-8.
- Improve the accuracy and speed of the chunker/parser by backward-incompatible feature engineering.
- Add a command-line option to specify input encoding (UTF-8 or EUC-JP).
- Remove USE_EMNLP_FEAT compilation flag.
Jan 4th, 2012:
- Rewrite interface with GNU Getopt.
- Support a stand-alone mode (integrate MeCab into J.DepP).
- Support both JUMAN and MeCab as a front-end morphological analyzer
- Support human-readable tree outputs (to_tree.py)
- Support training with auto POSs (replace_pos.py generates training data with auto POSs from the one with gold POSs).
- Support automatic model identification; now you don't need to provide the parameters used to train the model in testing.
- Support all configurable options of the learners/classifiers (delegated to individual libraries as is).
- Simplify codes for building a string-to-feature-to-ID dictionary (different feature indexing scheme); this may affect the model accuracy (±.03% level).
- Reduce overhead in feature extraction using bit twiddling.
August 29th, 2011:
- Support the latest version of Tsuruoka's MaxEnt Estimator [-meopt=SGD|OWLQN|LBFGS].
- Support command-line configuration of opal options for jdepp [-oopt=P|PA1|PA2, -oave=0|1].
- Support command-line configuration of pecco options for cbun (provide them after '--' in command-line).
- Remove intricacy in Makefile (you can make install).
- Fix a bug in per-sentence parsing from STDIN (thanks to Dr. Kazama).
June 15th, 2011:
- Support IPA POS tag set (experimental).
March 7th, 2011:
- Add a cascaded chunking model [-p chunking] (experimental).
- Optimize the code of the backward model
January 27th, 2011:
- Add a simple bunsetsu chunker (IN: MeCab; OUT: KNP).
- Fix a bug in building a morph dict on Linux (thanks to Dr. Kazama)
- Change default parameters and option names
January 10th, 2011:
- Elaborate features in the tournament model (dependency accuracy > 92%).
September 16th, 2010:
- Follow a change in APIs of opal.
August 1st, 2010:
- Fix a bug in building a feature dictionary.
- Support the double array implementation (cedar.h dda.h) included in opal.
July 27th, 2010:
- Change the argument order (default parameters as well) given to opal.
July 21st, 2010:
- Resolve memory leak.
- Support opal for training.
May 16th, 2010:
- Fix a bug in feature naming (increase MAGIC to avoid a naming collision among features in tournament model).
December 1st, 2009:
- initial release.

Usage

Typing ./jdepp -h shows the following usage information.

J.DepP - Japanese Dependency Parser
Copyright (c) 2008-2012 Naoki Yoshinaga

Usage: jdepp [options] -- [learner options] -- [chunker classifier options] -- [parser classifier options] < test

test    test file

Optional parameters in training / testing:
  -t, --type=TYPE             select running mode of J.DepP
                                0 - learn
                              * 1 - parse
                                2 - both
                                3 - cache
  -e, --encoding=TYPE         select encoding of input
                              * 0 - UTF-8
                                1 - EUC-JP
  -i, --ignore=STR            ignore input line starting with STR
  -c, --corpus=FILE           training corpus in JDEPP format ('train.JDP')
  -m, --model-dir=DIR         model directory ('/Users/ynaga/local/lib/jdepp/model/kyoto')
  -p, --parser=TYPE           select parsing algorithm
                              * 0 - shift reduce
                                1 - cascaded chunking
                                2 - backward
                                3 - tournament
  -I, --input-format=TYPE     select type of input format
                              * 0 - POS-tagged sentences
                                1 - + BUNSETSU annotation
                                2 - + DEPENDENCY annotation

Optional parameters in training:
  -l, --learner=TYPE          select type of learning library
                              * 0 - OPAL
                                1 - SVM    (disabled)
                                2 - MaxEnt (disabled)
  -n, --max-sent=INT          max. # processing sentences (0: all)

Misc.:
  -v, --verbose=INT           verbosity level (0)
  -h, --help                  show this help and exit

Training

Type make model to build a model for J.DepP; by modifying configuration parameters, you can build various models from KyotoCorpus4.0 or KNB corpus.

# with Kyoto-University and NTT Blog corpus (will be automatically downloaded before training a model)
## training a parser with auto POSs given by MeCab/jumandic (this is the default parser configuration)
> configure (--with-corpus=knbc) (--with-postagger=mecab) (--with-mecab-dict=JUMAN)
## build a parser with auto POSs given by MeCab/NAIST-jdic
> configure (--with-corpus=knbc) (--with-postagger=mecab) --with-mecab-dict=NAIST-J
## build a parser with auto POSs given by MeCab/ipadic
> configure (--with-corpus=knbc) (--with-postagger=mecab) --with-mecab-dict=IPA
## build a standalone parser with a model trained using KNB corpus (requires MeCab)
> configure (--with-corpus=knbc) --enable-standalone

# with Kyoto University Text Corpus (put KyotoCorpus4.0 in the top of J.DepP source directory)
## training a parser with gold POSs (to measure the parsing accuracy)
> configure --with-corpus=kyoto --disable-autopos-train
## training a parser with auto POSs given by JUMAN
> configure --with-corpus=kyoto --with-postagger=juman

# bracketed configuration options are default (so you can omit it)

Alternatively, you can train a parser with your own corpus in the following way.

# prepare the training data in the JDEPP format (morphological analyzer output + dependency annotation)
# you can convert training data in the KyotoCorpus format into JDEPP format as follows
# to train a parser compatible with JUMAN
> cat train.KNP | \
awk '!/^(#|\*|E)/ {$0 = $1" "$2" "($3 == "*" ? $1 : $3)" "$4" 0 "$5" 0 "$6" 0 "$7" 0 NIL"}; 1' > train.JDP
# to train a parser compatible with MeCab/jumandic
> cat train.KNP | \
awk '!/^(#|\*|E)/ {$0 = $1"\t"$4","$5","$6","$7","($3 == "*" ? $1 : $3)","$2",*"}; 1' > train.JDP

# You may want to train J.DepP with auto POSs given by the front-end POS tagger
# to avoid the accuracy drop due to POS inconsistency; the training with auto POSs
# builds a more accurate parser than the training with gold POSs.
# JUMAN
> replace_pos.py juman -b < train.KNP > train.JDP
# MeCab/jumandic
> replace_pos.py mecab -d MECAB_DIC_DIR < train.KNP > train.JDP
# MeCab/ipadic or MeCab/naist-jdic
> replace_pos.py mecab -d MeCab_DIC_DIR < train.KNP > train.JDP

# create a directory to save a model
> mkdir model

# train chunker/parser with opal [-l 0], TinySVM [-l 1], or Tsuruoka's MaxEnt [-l 2]
# you can configure [learner options] delegated to the learner
# to see the learner's options, set -h to learner options
> jdepp -t 0 -c train.JDP -I 1 < test.JDP # chunker
> jdepp -t 0 -c train.JDP -I 2 < test.JDP # parser

# typical model hyper-parameters for a chunker
# PA without kernel; ultimately fast but less accurate
> jdepp -t 0 -c train.JDP -I 1 -- -t 0 -c 0.05 -i 40
# default parameters; reasonably fast and enough accurate (recommended)
> jdepp -t 0 -c train.JDP -I 1 -- -t 1 -d 2 -c 0.00005 -i 40

# typical model hyper-parameters for training a dependency parser
#  pa_pl0; PA with linear kernel; ultimately fast but less accurate
> jdepp -t 0 -c train.JDP -I 2 -- -t 0 -c 0.001 -i 40
#  pa_pl2; default parameters; reasonably fast and enough accurate (recommended)
> jdepp -t 0 -c train.JDP -I 2 -- -t 1 -d 2 -c 0.00005 -i 40
#  pa_pl3; PA1 with -d 3; slow but most accurate
> jdepp -t 0 -c train.JDP -I 2 -- -t 1 -d 3 -c 0.000001 -i 40

NOTE: The default parameters are tuned for training a shift-reduce parser with opal [-l 0 -p 0]; if you want to use the other parsing algorithms [-p 1|2|3] or estimators [-l 1|2], you should at least tune the regularization parameter [-c].

# build a feature sequence trie [to speed up a parser with d≥2 model]
#   note: you will gain a significant speed-up only with d>=3 models
# 1) apply J.DepP to a part of gigantic data (hopefully in the same domain you're going to analyze)
#    [the same format as train.JDP; no correct dependency annotation needed]
# 2) pass the data via [-c] to enumerate common feature sequences
#    (set [classifier option] accordingly)
# example (you can first consider the use of the training data):
> jdepp -t 3 -c train.JDP -I 1 -- -- -- -t 1 # (for chunker)
> jdepp -t 3 -c train.JDP -I 2 -- -- -- -t 1 -r 0.005 # (for parser)
> jdepp -t 3 -c train.JDP -I 2 -- -- -- -t 1 -s 0.015 # (for parser)

# you can configure [chunker|parser classifier options] to chunk|parse a sentence
# to see classifier's options, set -h to classifier options
# example:
#  pa_pl0 (when you trained a chunker|parser using opal [-t 0],
#  opal is also used for classification)
> jdepp < test.sent
#  pa_pl2 (default PKE classifier; pecco [-t 1 -s 0.015])
> jdepp -- -- -- -t 1 -s 0.015 < test.sent
#  pa_pl2 (slightly slower SPLIT classifier)
> jdepp -- -- -- -t 1 -r 0.005 < test.sent
#  pa_pl2 (slightly faster FST classifier [-t 2])
> jdepp -- -- -- -t 2 -s 0.015 -i 8 < test.sent

# classifier options can be omitted in testing

Profiling

# script `to_tree.py' helps you understand J.DepP's machine-friendly parser output.
> jdepp < test.sent | to_tree.py

# `to_chunk.py' and `to_tree.py' compactly visualize the parser output
# when you input sentences w/ annotations [-I 1,2],
> jdepp -I 1 -v -1 < dev.JDP | to_chunk.py -p | less -R
> jdepp -I 2 -v -1 < dev.JDP | to_tree.py  -p | less -R

# If you have an issue in rendering wide characters on Terminal of Mac OS X,
# try SIMBL plugin `TerminalEastAsianAmbiguousClearer'

NOTE: We highly recommend you to use a (default) passive aggressive algorithm [-l 0] to train classifiers for parsers, since its training speed is order of magnitude faster than SVM/MaxEnt and the accuracy of the resulting models are comparable to SVM.

Performance Comparison

The following table lists the statistics of models referred in usage section. The experiments were conducted on MacBook Air (Mid 2011), Mac OS X 10.7 over Intel Core i7 1.8Ghz CPU with 4GB main memory. Note that, for reference purpose, the parser is here configured with --disable-autopos-train (Training with gold POSs is disabled by default, because it is not appropriate when you run the parser with a POS tagger).

The parser accuracy is measured on the standard data-set (Kyoto University Text Corpus version 4.0; training: 9501<01-11>.KNP and 95<01-08>ED.KNP, testing: 9501<14-17>.KNP and 95<10-12>ED.KNP) [1]. Testing shows throughput (# POS-tagged/bunsetsu-segmented sentences per second in Mainichi news articles (EUC-JP) parsed by J.DepP.

Model ID	Algorithm [-p]	opal option	pecco option	Dep. Acc. (%)	Sent. Acc. (%)	Training [s]	Testing [sent./s]
pa_pl0	linear [3]	-t 0 -c 1.0e-3 -i 40 -P	n/a	89.61	49.76	3.2	111385
pa_pl2	linear [3]	-t 1 -d 2 -c 5.0e-5 -i 40 -p	-t 1 -r 0.005 -i 7	92.12	57.87	35.0	33467
pa_pl2 (default)	linear [3]	-t 1 -d 2 -c 5.0e-5 -i 40 -p	-t 1 -s 0.015 -i 7	92.09	57.73	35.0	41289
pa_pl3	linear [3]	-t 1 -d 3 -c 1.0e-6 -i 40 -kp	-t 1 -r 0.05	92.29	58.80	259.8	3434
pa_pl3	linear [3]	-t 1 -d 3 -c 1.0e-6 -i 40 -kp	-t 1 -s 0.001	92.22	58.43	259.8	8051
pa_pl2 stacking KNP 4.16 with juman-7.0	linear [3]	-t 1 -d 2 -c 5.0e-5 -i 40 -p	-t 1 -s 0.015 -i 7	92.92	-	-	-
pa_pl3 stacking KNP 2.0 with juman-5.1	linear [3]	(sorry, I forgot exact ones)	(ditto)	93.19	-	-	-

You can further speed up a parser with a classifier (d≥3) by building a larger feature sequence trie from possible feature vectors generated by using the parser itself. The pecco paper might be helpful to tune hyper-parameters in training models with SVM/MaxEnt (they are trained with a smaller number of features, though).

Disclaimer

We do not guarantee that the implemented algorithms other than those proposed by us are patent-free; we regarded them to be patent-free simply because their implementations are available as (existing) open-source softwares (otherwise a simple patent look-up). Please be careful when you use this software for commercial use.

If you train J.DepP with Kyoto University Text Corpus (--with-corpus=kyoto, --with-corpus=kyoto-partial), you can use the trained model only for research purpose (Refer to http://www.nichigai.co.jp/sales/mainichi/mainichi-data.html, in Japanese).

How to pronounce `J.DepP'?

Read as you want; hopefully enjoy with spelling out J.; wandering around J for Johnny, Juggling, Jeering, JIT or whatever, before going to most boring `Japanese'.

Acknowledgments

The developer thanks Prof. Daisuke Kawahara for his guidance in converting KNB corpus to Kyoto University Text Corpus format.

References

K. Uchimoto, S. Sekine, and H. Isahara. Japanese Dependency Structure Analysis Based on Maximum Entropy Models. Proc. EACL, pp. 196--203, 1999.
T. Kudo and Y. Matsumoto. Japanese Dependency Analysis using Cascaded Chunking. Proc. CoNLL, pp. 63--69. 2002.
M. Sassano. Linear-Time Dependency Analysis for Japanese. Proc. COLING, pp. 8--14. 2004.
M. Iwatate, M. Asahara, and Y. Matsumoto. Japanese Dependency Parsing Using a Tournament Model. Proc. COLING, pp. 361--368. 2008.
Y. Tsuruoka, J. Tsujii, and S. Ananiadou. Stochastic Gradient Descent Training for L1-regularized Log-linear Models with Cumulative Penalty. Proc. ACL-IJCNLP, pp. 477-485.
Y. Freund and R. E. Schapire. Large Margin Classification using the Perceptron Algorithm. Machine Learning 37(3):277-296, 1999.
K. Crammer, O. Dekel, J. Keshet, S. Shalev-Shwartz, and Y. Singer. Online Passive-Aggressive Algorithms. JMLR 7(Mar):551--585. 2006.