Michael Wayne Goodman - Publications

Publications

2020

Michael Wayne Goodman Penman: An Open-Source Library and Tool for AMR Graphs. Proceedings of ACL-2020 System Demonstrations 2020.
Paper Slides Abstract
Abstract Meaning Representation (AMR) (Banarescu et al., 2013) is a framework for semantic dependencies that encodes its rooted and directed acyclic graphs in a format called PENMAN notation. The format is simple enough that users of AMR data often write small scripts or libraries for parsing it into an internal graph representation, but there is enough complexity that these users could benefit from a more sophisticated and well-tested solution. The open-source Python library Penman provides a robust parser, functions for graph inspection and manipulation, and functions for formatting graphs into PENMAN notation. Many functions are also available in a command-line tool, thus extending its utility to non-Python setups.
bib
Francis Bond, Luis Morgado da Costa, Michael Wayne Goodman, John P. McCrae, Ahti Lohk. Some Issues with Building a Multilingual Wordnet. Proceedings of the 12th Language Resources and Evaluation Conference (LREC). 2020.
Paper Abstract
In this paper we discuss the experience of bringing together over 40 different wordnets. We introduce some extensions to the GWA wordnet LMF format proposed in Vossen et al. (2016) and look at how this new information can be displayed. Notable extensions include: confidence, corpus frequency, orthographic variants, lexicalized and non-lexicalized synsets and lemmas, new parts of speech, and more. Many of these extensions already exist in multiple wordnets &endash; the challenge was to find a compatible representation. To this end, we introduce a new version of the Open Multilingual Wordnet (Bond and Foster, 2013), that integrates a new set of tools that tests the extensions introduced by this new format, while also ensuring the integrity of the Collaborative Interlingual Index (CILI: Bond et al., 2016), avoiding the same new concept to be introduced through multiple projects.
bib

2019

Michael Wayne Goodman. AMR Normalization for Fairer Evaluation. Proceedings of the 33rd Pacific Asia Conference on Language, Information, and Computation (PACLIC 33). 2019.
Paper Abstract
Abstract Meaning Representation (AMR; Banarescu et al., 2013) encodes the meaning of sentences as a directed graph and Smatch (Cai and Knight, 2013) is the primary metric for evaluating AMR graphs. Smatch, however, is unaware of some meaning-equivalent variations in graph structure allowed by the AMR Specification and gives different scores for AMRs exhibiting these variations. In this paper I propose four normalization methods for helping to ensure that conceptually equivalent AMRs are evaluated as equivalent. Equivalent AMRs with and without normalization can look quite different---comparing a gold corpus to itself with relation reification alone yields a difference of 25 Smatch points, suggesting that the outputs of two systems may not be directly comparable without normalization. The algorithms described in this paper are implemented on top of an existing open-source Python toolkit for AMR and will be released under the same license.
bib
Valerie Hajdik, Jan Buys, Michael Wayne Goodman, and Emily M. Bender. Neural Text Generation from Rich Semantic Representations. Proceedings of the 2019 Conference of the North {A}merican Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT). 2019.
Paper Abstract
We propose neural models to generate high-quality text from structured representations based on Minimal Recursion Semantics (MRS). MRS is a rich semantic representation that encodes more precise semantic detail than other representations such as Abstract Meaning Representation (AMR). We show that a sequence-to-sequence model that maps a linearization of Dependency MRS, a graph-based representation of MRS, to text can achieve a BLEU score of 66.11 when trained on gold data. The performance of the model can be improved further using a high-precision, broad coverage grammar-based parser to generate a large silver training corpus, achieving a final BLEU score of 77.17 on the full test set, and 83.37 on the subset of test data most closely matching the silver data domain. Our results suggest that MRS-based representations are a good choice for applications that need both structured semantics and the ability to produce natural language text as output.
bib

2018

Michael Wayne Goodman, Ryan Georgi and Fei Xia. PDF-to-Text Reanalysis for Linguistic Data Mining. Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018).
Paper Poster Abstract
Extracting semi-structured text from scientific writing in PDF files is a difficult task that has faced researchers for decades. In the 1990s, this task was largely a computer vision and OCR problem, as PDF files were often the result of scanning printed documents. Today, PDFs have standardized digital typesetting without the need for OCR, but extraction of semi-structured text from these documents remains a nontrivial task. In this paper, we present a system for the reanalysis of glyph-level PDF extracted text that performs block detection, respacing, and tabular data analysis for the purposes of linguistic data mining. We further present our reanalyzed output format, which attempts to eliminate the extreme verbosity of XML output while leaving important positional information available for downstream processes.
bib

2016

Ann A. Copestake, Guy Emerson, Michael Wayne Goodman, Matic Horvat, Alexander Kuhnle, and Ewa Muszynska. Resources for building applications with Dependency Minimal Recursion Semantics. Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016).
Paper Abstract
We describe resources aimed at increasing the usability of the semantic representations utilized within the DELPH-IN (Deep Linguistic Processing with HPSG) consortium. We concentrate in particular on the Dependency Minimal Recursion Semantics (DMRS) formalism, a graph-based representation designed for compositional semantic representation with deep grammars. Our main focus is on English, and specifically English Resource Semantics (ERS) as used in the English Resource Grammar. We first give an introduction to ERS and DMRS and a brief overview of some existing resources and then describe in detail a new repository which has been developed to simplify the use of ERS/DMRS. We explain a number of operations on DMRS graphs which our repository supports, with sketches of the algorithms, and illustrate how these operations can be exploited in application building. We believe that this work will aid researchers to exploit the rich and effective but complex DELPH-IN resources.
bib
Fei Xia, William D. Lewis, Michael Wayne Goodman, Glenn Slayden, Ryan Georgi, Joshua Crowgey, and Emily M. Bender. Enriching a massively multilingual database of interlinear glossed text. Language Resources and Evaluation 50, no. 2 (2016): 321–349.
Link Abstract
The majority of the world's languages have little to no NLP resources or tools. This is due to a lack of training data (``resources'') over which tools, such as taggers or parsers, can be trained. In recent years, there have been increasing efforts to apply NLP methods to a much broader swath of the world's languages. In many cases this involves bootstrapping the learning process with enriched or partially enriched resources. We propose that Interlinear Glossed Text (IGT), a very common form of annotated data used in the field of linguistics, has great potential for bootstrapping NLP tools for resource-poor languages. Although IGT is generally very richly annotated, and can be enriched even further (e.g., through structural projection), much of the content is not easily consumable by machines since it remains ``trapped'' in linguistic scholarly documents and in human readable form. In this paper, we describe the expansion of the ODIN resource---a database containing many thousands of instances of IGT for over a thousand languages. We enrich the original IGT data by adding word alignment and syntactic structure. To make the data in ODIN more readily consumable by tool developers and NLP researchers, we adopt and extend a new XML format for IGT, called Xigt. We also develop two packages for manipulating IGT data: one, INTENT, enriches raw IGT automatically, and the other, XigtEdit, is a graphical IGT editor.
bib
Ryan Georgi, Michael Wayne Goodman, and Fei Xia. A Web-framework for ODIN Annotation. Proceedings of ACL-2016 System Demonstrations (2016): pp. 31–36.
Paper Abstract
The current release of the ODIN (Online Database of Interlinear Text) database contains over 150,000 linguistic examples, from nearly 1,500 languages, extracted from PDFs found on the web, representing a significant source of data for language research, particularly for low-resource languages. Errors introduced during PDF-totext conversion or poorly formatted examples can make the task of automatically analyzing the data more difficult, so we aim to clean and normalize the examples in order to maximize accuracy during analysis. In this paper we describe a system that allows users to automatically and manually correct errors in the source data in order to get the best possible analysis of the data. We also describe a RESTful service for managing collections of linguistic examples on the web. All software is distributed under an open-source license.
bib
Dan Flickinger, Michael Goodman, and Woodley Packard. UW-Stanford System Description for AESW 2016 Shared Task on Grammatical Error Detection. Proceedings of the 11th Workshop on Innovative Use of NLP for Building Educational Applications, pp. 105–111. 2016.
Paper Abstract
This is a report on the methods used and results obtained by the UW-Stanford team for the Automated Evaluation of Scientific Writing (AESW) Shared Task 2016 on grammatical error detection. This team developed a symbolic grammar-based system augmented with manually defined mal-rules to accommodate and identify instances of high-frequency grammatical errors. System results were entered both for the probabilistic estimation track, where we ranked second, and for the Boolean decision track, where we ranked fourth.
bib

2015

Michael Wayne Goodman, Joshua Crowgey, Fei Xia, and Emily M. Bender. Xigt: extensible interlinear glossed text for natural language processing.
Link Abstract
This paper presents Xigt, an extensible storage format for interlinear glossed text (IGT). We review design desiderata for such a format based on our own use cases as well as general best practices, and then explore existing representations of IGT through the lens of those desiderata. We give an overview of the data model and XML serialization of Xigt, and then describe its application to the use case of representing a large, noisy, heterogeneous set of IGT.
bib
Fei Xia, Michael Wayne Goodman, Ryan Georgi, Glenn Slayden, and William D. Lewis. Enriching, Editing, and Representing Interlinear Glossed Text. International Conference on Intelligent Text Processing and Computational Linguistics (pp. 32–46). Springer, Cham.
Link Abstract
The majority of the world’s languages have little to no NLP resources or tools. This is due to a lack of training data (“resources”) over which tools, such as taggers or parsers, can be trained. In recent years, there have been increasing efforts to apply NLP methods to a much broader swathe of the worlds languages. In many cases this involves bootstrapping the learning process with enriched or partially enriched resources. One promising line of research involves the use of Interlinear Glossed Text (IGT), a very common form of annotated data used in the field of linguistics. Although IGT is generally very richly annotated, and can be enriched even further (e.g., through structural projection), much of the content is not easily consumable by machines since it remains “trapped” in linguistic scholarly documents and in human readable form. In this paper, we introduce several tools that make IGT more accessible and consumable by NLP researchers.
bib

2014

Emily M. Bender, Joshua Crowgey, Michael Wayne Goodman, and Fei Xia. Learning grammar specifications from IGT: A case study of Chintang. Proceedings of the 2014 Workshop on the Use of Computational Methods in the Study of Endangered Languages, pp. 43–53. 2014.
Paper Abstract
We present a case study of the methodology of using information extracted from interlinear glossed text (IGT) to create of actual working HPSG grammar fragments using the Grammar Matrix focusing on one language: Chintang. Though the results are barely measurable in terms of coverage over running text, they nonetheless provide a proof of concept. Our experience report reflects on the ways in which this task is non-trivial and on mismatches between the assumptions of the methodology and the realities of IGT as produced in a large-scale field project.
bib
Fei Xia, William D. Lewis, Michael Wayne Goodman, Joshua Crowgey, and Emily M. Bender. Enriching ODIN. Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014).
Paper Abstract
In this paper, we describe the expansion of the ODIN resource, a database containing many thousands of instances of Interlinear Glossed Text (IGT) for over a thousand languages. A database containing a large number of instances of IGT, which are effectively richly annotated and heuristically aligned bitexts, provides a unique resource for bootstrapping NLP tools for resource-poor languages. To make the data in ODIN more readily consumable by tool developers and NLP researchers, we propose a new XML format for IGT, called Xigt. We call the updated release ODIN-II.
bib

2013

Emily M. Bender, Michael Wayne Goodman, Joshua Crowgey, and Fei Xia. Towards creating precision grammars from interlinear glossed text: Inferring large-scale typological properties. Proceedings of the 7th workshop on language technology for cultural heritage, social sciences, and humanities, pp. 74–83. 2013.
Paper Abstract
We propose to bring together two kinds of linguistic resources—interlinear glossed text (IGT) and a language-independent precision grammar resource—to automatically create precision grammars in the context of language documentation. This paper takes the first steps in that direction by extracting major-constituent word order and case system properties from IGT for a diverse sample of languages.
bib
Michael Wayne Goodman. Generation of Machine-Readable Morphological Rules from Human-Readable Input. UW Working Papers in Linguistics, 2013.
Paper Abstract
This paper presents a new morphological framework for a grammar customization system. In this new system, the Lexical Integrity Principle is upheld with regards to morphological and syntactic processes, but the requirements for wordhood are expanded from a binary distinction to a set of constraints. These constraints are modeled with co-occurrence restrictions via flags set by lexical rules. Together with other new features, such as the ability to define disjunctive requirements and lexical rule hierarchies, these co-occurrence restrictions allow more complex, long-distance restrictions for phenomena such as bipartite stems, inclusive and exclusive OR-patterned restrictions on morpheme occurrence, constraints dependent on lexical type, and more. I show how the system is able to correctly handle patterns of French object clitics, Lushootseed tense and aspect markers, and Chintang bipartite stems.
bib

2012

Emily M Bender, David Wax, and Michael Wayne Goodman. From IGT to precision grammar: French verbal morphology. LSA Annual Meeting Extended Abstracts, no 3. 2012.
Link Abstract
Interlinear glossed text (IGT, the familiar three-line format of linguistic examples) can be an extremely rich source of linguistic information, when linguists follow best practices in creating it (e.g., the Leipzig glossing rules, Comrie et al. 2003). The ODIN project (http://www.csufresno.edu/odin; Lewis 2006) recognized the value of IGT data as a reusable data type and has created a searchable IGT database.This paper represents early efforts in a project to combine aggregations of IGT with a second source of linguistic knowledge to automatically produce implemented formal grammars. The second source of linguistic knowledge is the LinGO Grammar Matrix customization system (Bender et al. 2010). The Grammar Matrix is a multilingual grammar engineering project which includes a cross-linguistic core HPSG (Pollard and Sag 1994) grammar and a set of analyses for cross-linguistically variable phenomena which can be selected via a web-based questionnaire. As an initial pilot study, we focus on verb morphology (including morphotactics and the morphosyntactic effects of affixes) and we begin with a best-case scenario: For our IGT, we use the complete paradigm for the French verb faire (‘to do/make’) provided by Olivier Bonami (pc), including 15,658 phonologically transcribed, morphologically segmented and glossed verb forms.
bib

2010

Emily M Bender, Scott Drellishak, Antske Fokkens, Michael Wayne Goodman, Daniel P Mills, Laurie Poulson, and Safiyyah Saleem. Grammar prototyping and testing with the LinGO grammar matrix customization system. Proceedings of the ACL 2010 system demonstrations.
Paper Poster Abstract
This demonstration presents the LinGO Grammar Matrix grammar customization system: a repository of distilled linguistic knowledge and a web-based service which elicits a typological description of a language from the user and yields a customized grammar fragment ready for sustained development into a broad-coverage grammar. We describe the implementation of this repository with an emphasis on how the information is made available to users, including in-browser testing capabilities.
bib
Michael Wayne Goodman and Emily M Bender. What’s in a word? refining the morphotactic infrastructure in the LinGO Grammar Matrix customization system. Workshop on morphology and formal grammar, 2010.
Paper Poster Abstract
In this paper, we present refinements to the Grammar Matrix’s original morphotactic infrastructure (O’Hara, 2008), in order to better meet two constraints: (i) The system must be able to handle all types of morpheme co-occurrence restrictions found in the world’s languages; and (ii) the grammars it produces must be human- as well as machine-readable, i.e., suitable for extension and maintenance by grammar engineers.
bib

2009

Michael Wayne Goodman and Francis Bond. Using generation for grammar analysis and error detection. Proceedings of the ACL-IJCNLP 2009 conference short papers, pp. 109–112.
Paper Abstract
We demonstrate that the bidirectionality of deep grammars, allowing them to generate as well as parse sentences, can be used to automatically and effectively identify errors in the grammars. The system is tested on two implemented HPSG grammars: Jacy for Japanese, and the ERG for English. Using this system, we were able to increase generation coverage in Jacy by 18% (45% to 63%) with only four weeks of grammar development.
bib
Jianqiang Shen, Jed Irvine, Xinlong Bao, Michael Goodman, Stephen Kolibaba, Anh Tran, Fredric Carl, Brenton Kirschner, Simone Stumpf, and Thomas G Dietterich. Detecting and correcting user activity switches: algorithms and interfaces. Proceedings of the 14th international conference on Intelligent user interfaces, pp. 117–126. 2009.
Paper Abstract
The TaskTracer system allows knowledge workers to define a set of activities that characterize their desktop work. It then associates with each user-defined activity the set of resources that the user accesses when performing that activity. In order to correctly associate resources with activities and provide useful activity-related services to the user, the system needs to know the current activity of the user at all times. It is often convenient for the user to explicitly declare which activity he/she is working on.But frequently the user forgets to do this. TaskTracer applies machine learning methods to detect undeclared activity switches and predict the correct activity of the user. This paper presents TaskPredictor2, a complete redesign of the activity predictor in TaskTracer and its notification user interface. TaskPredictor2 applies a novel online learning algorithm that is able to incorporate a richer set of features than our previous predictors. We prove an error bound for the algorithm and present experimental results that show improved accuracy and a 180-fold speedup on real user data. The user interface supports negotiated interruption and makes it easy for the user to correct both the predicted time of the task switch and the predicted activity.
bib