吉永直樹 / Naoki Yoshinaga, Ph.D --- University of Tokyo

吉永直樹 / Naoki Yoshinaga, PhD

Associate Professor (Tenured) and The University of Tokyo Excellent Young Researcher

Institute of Industrial Science, The University of Tokyo
Department of Information and Communication Engineering, The University of Tokyo

Ee-503, 4-6-1 Komaba, Meguro-Ku, Tokyo, 153-8505, Japan
+81-3-5452-6239
ynaga (at) iis [guess from URL]

For prospective students: refer to Admission guide and Laboratory leaflet before contact; please send me an e-mail with subject of `On supervision' (otherwise, ignored).

Curriculum Vitae

2016 - present:	Associate Professor at Institute of Industrial Science, the University of Tokyo
2014 - 2016:	Senior Researcher at National Institute of Information and Communications Technology (NICT)
2012 - 2016:	Project Associate Professor at Institute of Industrial Science, the University of Tokyo
2008 - 2012:	Project Assistant Professor at Institute of Industrial Science, the University of Tokyo
2002 - 2008:	Research Fellow of the Japan Society for the Promotion of Science (JSPS) (DC1, PD) (11.6%, 10.0% accepted, respectively)
2002 - 2005:	Ph.D. in Department of Computer Science, Graduate School of Information Science and Technology, the University of Tokyo
2000 - 2002:	M.Sc. in Department of Information Science, Graduate School of Science, the University of Tokyo
1996 - 2000:	B.Sc. in Department of Information Science, Faculty of Science, the University of Tokyo

Research Interests

We're studying various aspects on natural language processing (NLP) and computational linguistics (CL), especially

Mechanistic interpretability [BlackBoxNLP-21, EACL-24, F. EMNLP-24, ACL-25]

A pragmatic (self-adaptive) model for NLP in the wild; beyond RAG [IJCNLP-13, ACL-13, ACL-17 SRW, NAACL-19, F. EMNLP-20, EACL-23]

Multilingual and Multimodal NLP [CoNLL-15, ACL-17, NAACL-19, CoNLL-19, F. EMNLP-20, NAACL-21, EACL-24, NAACL-24 SRW, ACL-25]

Evaluation metrics of language generation: [ACL-20 SRW, AACL-23 SRW]

A fast, compact, yet accurate model with algorithmic contributions [EMNLP-09, ACL-10, COLING-10, COLING-14, ACL-23, CoNLL-24]

Knowledge acquisition from social big data [EMNLP-12, IJCAI-16, IJCAI-19, F. EMNLP-21, ACL-23]

Past: information visualization for NLP [PacificVis-11, IUI-16, PacificVis-18]

I like to design important NLP tasks [EMNLP-12, ACL-13, COLING-14, IJCAI-16, IJCAI-19], rather than solving classic tasks on worn-out datasets.

Research Grants

Softwares for NLP/CL

Data Structures:

cedar: An efficient, updatable trie implementation based on double array.

Algorithms:

opal: A scalable kernel-based online learner (polynomial kernel is supported).
pecco: An efficient classifier for a model pre-trained with conjunctive features (or polynomial kernel).
yakmo: A robust, efficient alternative k-means clustering.

Systems:

Jagger: Super fast morphological analyzer for Japanese.
J.DepP: Very fast dependency parsers with the state-of-the-art accuracy for Japanese.
RenTAL (no longer maintained): A grammar compiler from Lexicalized TAG to HPSG-style grammar.

Note that some of the above softwares are no longer experimental codes; they are substantially elaborated from the original codes to have a better performance (2-5x speed-up, or +1% in accuracy etc.; track History section of each software). Those who want to reproduce the experimental results may want to use the oldest release of the softwares.

Selected Publications (with links to presentations, codes, and datasets)

Neuron Empirical Gradient: Discovering and Quantifying Neurons’ Global Linear Controllability (arXiv ver.)
ACL-25. Joint work with X. Zhao and Z. Jiang
TL;DR: We reveal a linear link between neuron activations and probabilities, introducing Neuron Empirical Gradient.

A-TASC: Asian TED-Based Automatic Subtitling Corpus
ACL-25. Joint work with Y. Zhou
TL;DR: We develop a large-scale corpus and metrics for automatic subtitling in Asian languages

Further Compressing Distilled Language Models via Frequency-aware Partial Sparse Coding of Embeddings
CoNLL-24. Joint work with K. Tamura and M. Neishi
TL;DR: We compress rare token embeddings using their nearest neighbor embeddings for common tokens as basis embeddings.

What Matters in Memorizing and Recalling Facts? Multifaceted Benchmarks for Knowledge Probing in Language Models --- data / slide / poster
Findings of EMNLP-24. Joint work with X. Zhao and D. Oba
TL;DR: We present BELIEF, a comprehensive knowledge probing benchmark for LLMs, using a newly-built diverse multi-prompt datasets, MyriadLAMA.

Commentary Generation from Data Records of Multiplayer Strategy Esports Game (journal ver.) --- data / poster
NAACL-24 SRW. Joint work with Zihan Wang
TL;DR: We set up a task (datasets) of generating commentaries from eSports data records.

Tracing the Roots of Facts in Multilingual Language Models: Independent, Shared, and Transferred Knowledge (arXiv ver.) --- code / silde
EACL-24 (long, acceptance rate 17.8%). Joint work with Xin Zhao and Daisuke Oba
TL;DR: We reveal knowledge representaions in multilingual language models.

Rethinking Response Evaluation from Interlocutor's Eye for Open-Domain Dialogue Systems --- data
IJCNLP-AACL-23 SRW. Joint work with Y. Tsuta, S. Sato, and M. Toyoda
TL;DR: We propose automatic personalized evaluation for dialogue systems in terms of engagement.

Back to Patterns: Efficient Japanese Morphological Analysis with Feature-Sequence Trie --- silde / poster / code
ACL-23 (short, acceptance rate 16.53%). single-authored work!
TL;DR: You know, patterns are just great.

Early Discovery of Disappearing Entities in Microblogs --- silde / poster / data
ACL-23 (long, acceptance rate 23.50%). Joint work with S. Akasaki and M. Toyoda
TL;DR: We set up and tackle a task of detecting disappearing entities from Twitter timelines.

A Unified Generative Approach to Product Attribute-Value Identification
Findings of ACL-23 (long). Joint work with Keiji Shinzato, Yandi Xia and Wei-Te Chen
TL;DR: You should use generation to solve product attribute-value identification, rather than extraction and classification.

Self-Adaptive Named Entity Recognition by Retrieving Unstructured Knowledge (journal ver.) --- poster
EACL-23 (long, acceptance rate 28.3%). Joint work with Kousuke Nishida and Kyousuke Nishida
TL;DR: We propose an NER model that autonomically search for knowledge required to recognize and type inconfident entities.

Entity Embedding Completion for Wide-Coverage Entity Disambiguation
Findings of EMNLP-22. Joint work with D. Oba, I. Yamada and M. Toyoda
TL;DR: We develop a lightweight method for adapting entity disambigation models to a new domain with out-of-vocabulary entities.

Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product Attribute Extraction --- slide / data
ACL-22 (short, acceptance rate 20.75%). Joint work with K. Shinzato, Y. Xia and W.-T. Chen
TL;DR: We show that internal knowledge in the training data improves the performance of QA-based attribute value extraction.

Exploratory Model Analysis Using Data-Driven Neuron Representations --- poster
BlackBoxNLP Workshop. Joint work with D. Oba and M. Toyoda
TL;DR: We establish a methodology for exploratory, hypothesis-free analysis of neural NLP models.

Fine-grained Typing of Emerging Entities in Microblogs --- code & data
Findings of EMNLP-21 (long, acceptance rate 34.9%). Joint work with S. Akasaki and M. Toyoda
TL;DR: We set up and tackle a task of typing emerging entities in microblogs.

Speculative Sampling in Variational Autoencoders for Dialogue Response Generation --- slide / code & data
Findings of EMNLP-21 (short, acceptance rate 34.9%). Joint work with S. Sato, M. Toyoda, and M. Kitsuregawa
TL;DR: We redundantly pick latent variables in the training of variational models to obtain better latent space.

Context-aware Decoder for Neural Machine Translation using a Target-side Document-Level Language Model --- slide / poster / code & data
NAACL-21 (long, acceptance rate 28%). Joint work with A. Sugiyama
TL;DR: We utilize PMI computed by a document-level LM to perform context-aware decoding with a sentence-level NMT model.

Robust Backed-off Estimation of Out-of-Vocabulary Embeddings --- code
Findings of EMNLP-20 (long, acceptance rate 37%). Joint work with N. Fukuda and M. Kitsuregawa
TL;DR: Inspired by two processes of creating words, we propose a simple word-based method to estimate OOV embeddings.

Vocabulary Adaptation for Domain Adaptation in Neural Machine Translation --- code
Findings of EMNLP-20 (long, acceptance rate 37%). Joint work with S. Sato, J. Sakuma, M. Toyoda, M. Kitsuregawa
TL;DR: We transplant target-domain vocabularies to source-domain NMT model for effective fine tuning.

uBLEU: Uncertainty-Aware Automatic Evaluation Method for Open-Domain Dialogue Systems (journal ver.) --- slide / code
ACL-20 SRW (acceptance rate 36%). Joint work with Y. Tsuta and M. Toyoda
TL;DR: Our υBLEU performs retrieval-augmented evaluation for open-domain dialogues allowing diverse responses.

Data augmentation using back-translation for context-aware neural machine translation --- code
DiscoMT-19. Joint work with A. Sugiyama
TL;DR: We investigated the impact of data augmentation on context-aware neural MT.

Multilingual model using cross-task embedding projection --- slide / code
CoNLL-19 (oral, acceptance rate: 22%). Joint work with J. Sakuma
TL;DR: Our locally-linear mapping optimizes multilingual models based on cross-lingual word embeddings to any tasks.

On the Relation between Position Information and Sentence Length in Neural Machine Translation --- poster / code
CoNLL-19 (acceptance rate: 22%). Joint work with M. Neishi
TL;DR: We revealed that Transformer is bad at handling inputs of unseen lengths, and fixed it using RNN as relative position encoder.

Early Discovery of Emerging Entities in Microblogs --- slide / poster / data
IJCAI-19 (acceptance rate: 13.7%). Joint work with S. Akasaki and M. Toyoda
TL;DR: We set up and tackle a task of detecting emerging entities from Twitter timelines using timely distant supervision.

Learning to Describe Unknown Phrases with Local and Global Contexts --- slide / code & data
NAACL-19 (long, oral; acceptance rate 26%). Joint work with S. Ishiwatari, H. Hayashi, G. Neubig, S. Sato, M. Toyoda, and M. Kitsuregawa
TL;DR: Our LOG-CaD can explain unknown phrases; evaluated on new Wikipedia datasets.

Modeling Personal Biases in Language Use by Inducing Personalized Word Embeddings (journal ver.) --- slide
NAACL-19 (short, oral; acceptance rate 21%). Joint work with D. Oba, S. Sato, S, Akasaki, and M. Toyoda
TL;DR: We enable to model personalized usage of words, accompanied with analysis and applications.

Information Integrated Visualization System for Heavy Rainfall Risk Analysis
PacificViz-18 (poster). Joint work with M. Itoh, T. Sagara, U. Suzuki, K. Umemoto, M. Toyoda, K. Zettsu, and Y. Kidawara
TL;DR: We integrate and visualize social sensor data to analyze heavy rainfall risks.

A Bag of Useful Tricks for Practical Neural Machine Translation: Embedding Layer Initialization and Large Batch Size (journal ver.) --- poster / code
WAT-17 (oral). Joint work with M. Neishi, J. Sakuma, S. Tohda, S. Ishiwatari and M. Toyoda
TL;DR: We confirmed the effectiveness of CBoW-based embedding layer initialization and Large Batch Size in NMT training.

Modeling Situations in Neural Chat Bots --- poster
ACL-17 SRW (acceptance rate 36%). Joint work with S. Sato, M. Toyoda and M. Kitsuregawa
TL;DR: We enable to model and incorporate user profiles and time in open-domain dialogue systems.

Chunk-based Decoder for Neural Machine Translation --- poster
ACL-17 (long; acceptance rate 26%). Joint work with S. Ishiwatari, J. Yao, S. Liu, M. Li, M. Zhou, M. Kitsuregawa and W. Jia
Inspired by phrase-based SMT, we propose chunk-based decoding in NMT.

Ordering Concepts Based on Common Attribute Intensity --- slide / poster / code & data
IJCAI-16 (acceptance rate <25%). Joint work with T. Iwanari, N. Kaji, T. Nishina, M. Toyoda and M. Kitsuregawa; a follow-up paper at COLING-16 (Demo) --- poster / software & data
TL;DR: We set up and tackle a task of inducing your sense of values from your writings.

Spatio-temporal Event Visualization from a Geo-parsed Microblog Stream --- poster / demo by Prof. Itoh
IUI-16 (poster). Joint work with M. Itoh and M. Toyoda
TL;DR: We visualize spatio-temporal tweets on word-clouds in the sky.

Accurate Cross-lingual Projection between Count-based Word Vectors by Exploiting Translatable Context Pairs --- poster
CoNLL-15 (short, acceptance rate <30%). Joint work with S. Ishiwatari, N. Kaji, M. Toyoda and M. Kitsuregawa
TL;DR: We incorporate surface-based dimensional correspondences into count-based word vectors.

A Self-adaptive Classifier for Efficient Text-stream Processing --- poster / software
COLING-14 (acceptance rate 32%). Joint work with M. Kitsuregawa
TL;DR: We propose a method of accelerating NLP classifiers when the processed text becomes redundant.

Modeling User Leniency and Product Popularity for Sentiment Classification (journal ver.) --- poster
IJCNLP-13. Joint work with W. Gao, N. Kaji, and M. Kitsuregawa
TL;DR: We enable to model annotation and selection biases in sentiment analysis for unseen users and targets.

Predicting and Eliciting Addressee's Emotion in Online Dialogue --- poster
ACL-13 (long; acceptance rate 26%). Joint work with T. Hasegawa, N. Kaji, and M. Toyoda
TL;DR: We enable to model emotions in corpus-based open-dialogue systems.

Identifying Constant and Unique Relations by using Time-Series Text --- slide
EMNLP-12 (oral; acceptance rate 17%). Joint work with Y. Takaku, N. Kaji, and M. Toyoda
Use of massive Web in knowledge acquisition introduces enormous contradictions; solution provided.

Analysis and Visualization of Temporal Changes in Bloggers' Activities and Interests --- demo / demo by Prof. Itoh
PacificVis-12 (acceptance rate 34%). Joint work with M. Itoh, M. Toyoda, and M. Kitsuregawa
TL;DR: We visualize dependencies in weblogs to understand the state of the world.

Kernel Slicing: Scalable Online Training with Conjunctive Features --- slide / software & data
COLING-10 (oral; acceptance rate 19%). Joint work with M. Kitsuregawa
The kernel slicing generalizes kernel splitting (ACL 2008) to pack computations in online learning with polynomial kernel.

Efficient Staggered Decoding for Sequence Labeling --- software
ACL-10 (long; acceptance rate 25%). Joint work with N. Kaji, Y. Fujiwara, and M. Kitsuregawa
TL;DR: We made structured prediction scalable to the number of classes.

Polynomial to Linear: Efficient Classification with Conjunctive Features (journal ver.) --- poster / software & data
EMNLP-09 (acceptance rate 34%). Joint work with M. Kitsuregawa
TL;DR: We speed up testing of non-linear classifier with polynomial kernel.

Boosting Precision and Recall of Hyponymy Relation Acquisition from Hierarchical Layouts in Wikipedia --- software
LREC-08. Joint work with A. Sumida and K. Torisawa
TL;DR: We developed a method of quickly obtaining a large-scale hyponymy relations from Wikipedia.

Open-Domain Attribute-Value Acquisition from Semi-Structured Texts --- demo
ISWC-07 workshop, OntoLex. Joint work with K. Torisawa; a follow-up paper on writing-support environment
TL;DR: We developed a method of extracting attributes and their values for a given object using unsupervised wrapper induction.

Improving the Accuracy of Subcategorizations Acquired from Corpora
ACL-04 SRW (acceptance rate 28%)

A Debug Tool for Practical Grammar Development
ACL-03 (poster; acceptance rate 43%). Joint work with A. Yakushiji, Y. Tateisi, Y. Miyao, and J. Tsujii

Comparison between CFG Filtering Techniques for LTAG and HPSG (journal ver.)
ACL-03 (poster; acceptance rate 43%). Joint work with Y. Miyao, K. Torisawa, and J. Tsujii

Grammar Conversion from LTAG to HPSG (journal ver.) --- software
ESSLLI 2011 Student Session (oral; acceptance rate 26%). Joint work with Y. Miyao.

Awards

Committee Special Award the 29th Annual Meeting of the Association for Natural Language Processing (NLP) (2023)
JSAI SIG Research Award 2022: Japanese Society for Artificial Intelligence, Special Interest Group on Fundamental Problems of Artificial Intelligence (SIG-SLUD) (2022)
Best Interactive Award (1st place): the eighth Forum on Data Engineering and Information Management (DEIM) (2019)
JSAI 30th Anniversary Best Paper Award (2016)
Best Poster Award: the 17th International Conference on Intelligent Text Processing and Computational Linguistics (CICLing) (2016)
Best Interactive Award: the eighth Forum on Data Engineering and Information Management (DEIM) (2016)
Best Paper Award: the WebDB Forum (2015)
Business Award (Yahoo! Japan Award): the WebDB Forum (2014)
JSAI SIG Research Award 2013: Japanese Society for Artificial Intelligence, Special Interest Group on Fundamental Problems of Artificial Intelligence (SIG-FPAI) (2013)
Best Student Paper Award: IEICE Transactions on Information and Systems (2013).
Best Paper Award: the third Forum on Data Engineering and Information Management (DEIM) (2011).
Best Paper Award: the 72nd National Convention of IPSJ (2010).
Best Paper Award: Journal of Natural Language Processing (2009).
Business Award (1st place): the Symposium on DataBases and Web Information Systems (DBWeb) (2007)

Awards (as a supervisor):

Student Presentation Award: the seventh Forum on Data Engineering and Information Management (DEIM) (2023), for two papers.
Young Excellence Award: the 13th Dialogue System Symposium (2022).
Young Encouragement Award: the 253th SIG-NL symposium, Information Processing Society of Japan (2022).
Young Encouragement Award: the 252th SIG-NL symposium, Information Processing Society of Japan (2022).
Young Researcher Award: the 26th Annual Meeting of the Association for Natural Language Processing (NLP) (2022).
Young Encouragement Award: the 246th SIG-NL symposium, Information Processing Society of Japan (2020).
Young Researcher Award: the 26th Annual Meeting of the Association for Natural Language Processing (NLP) (2020).
Young Researcher Award: the 25th Annual Meeting of the Association for Natural Language Processing (NLP) (2019).
Student Presentation Award: the seventh Forum on Data Engineering and Information Management (DEIM) (2019), for two papers.
Student Presentation Award: the seventh Forum on Data Engineering and Information Management (DEIM) (2015).
Student Incentive Award: the WebDB Forum (2014)
Student Presentation Award: the sixth Forum on Data Engineering and Information Management (DEIM) (2014).
Student Incentive Award: the WebDB Forum (2013)
Young Researcher Award: the 19th Annual Meeting of the Association for Natural Language Processing (NLP) (2013).
Student Presentation Award: the fifth Forum on Data Engineering and Information Management (DEIM) (2013).
Best Student Paper Award: IEICE technical report on Natural Language Understanding and Models of Communication (2012).
Student Incentive Award: the fourth Forum on Data Engineering and Information Management (DEIM) (2012).

last-modified: Jun 28 21:10:59 2025, written by XHTML 1.1

吉永 直樹 / Naoki Yoshinaga, PhD