The 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing
Semantic Paths, a semantic search engine for ACL-IJCNLP 2021, is online which correlates papers, people, topics, areas and research bodies.
PROGRAM
Monday, August 2, 2021 (all times are UTC+0) |
|
08:15–08:35 |
Opening Session |
08:40–09:00 |
Presidential Address |
09:00–10:00 |
Keynote 1 : Advancing Technological Equity in Speech and Language Processing (Helen Meng) |
Session 1 |
Session 1A: Computational Social Science and Cultural Analytics 1 (Session Chair: Oren Tsur) |
10:00–10:10 |
Investigating label suggestions for opinion mining in German Covid-19 social media |
10:10–10:20 |
How Did This Get Funded?! Automatically Identifying Quirky Scientific Achievements |
10:20–10:30 |
Engage the Public: Poll Question Generation for Social Media Posts |
10:30–10:40 |
HateCheck: Functional Tests for Hate Speech Detection Models |
10:40–10:50 |
Unified Dual-view Cognitive Model for Interpretable Claim Verification |
10:50–10:57 |
Catchphrase: Automatic Detection of Cultural References |
Session 1B: Language Generation 1 (Session Chair: Nanyun (Violet) Peng) |
|
10:00–10:10 |
DeepRapper: Neural Rap Generation with Rhyme and Rhythm Modeling |
10:10–10:20 |
PENS: A Dataset and Generic Framework for Personalized News Headline Generation |
10:20–10:30 |
Enhancing Content Preservation in Text Style Transfer Using Reverse Attention and Conditional Layer Normalization |
10:30–10:40 |
Mention Flags (MF): Constraining Transformer-based Text Generators |
10:40–10:50 |
Generalising Multilingual Concept-to-Text NLG with Language Agnostic Delexicalisation |
10:50–10:57 |
On Training Instance Selection for Few-Shot Neural Text Generation |
Session 1C: Dialog and Interactive Systems 1 (Session Chair: Wei-Nan Zhang) |
|
10:00–10:10 |
Conversations Are Not Flat: Modeling the Dynamic Information Flow across Dialogue Utterances |
10:10–10:20 |
Dual Slot Selector via Local Reliability Verification for Dialogue State Tracking |
10:20–10:30 |
Transferable Dialogue Systems and User Simulators |
10:30–10:40 |
BoB: BERT Over BERT for Training Persona-based Dialogue Models from Limited Personalized Data |
10:40–10:50 |
GL-GIN: Fast and Accurate Non-Autoregressive Model for Joint Multiple Intent Detection and Slot Filling |
10:50–10:57 |
Coreference Resolution without Span Representations |
Session 1D: Information Extraction 1 (Session Chair: Alan Akbik) |
|
10:00–10:10 |
Accelerating BERT Inference for Sequence Labeling via Early-Exit |
10:10–10:20 |
Modularized Interaction Network for Named Entity Recognition |
10:20–10:30 |
Capturing Event Argument Interaction via A Bi-Directional Entity-Level Recurrent Decoder |
10:30–10:40 |
UniRE: A Unified Label Space for Entity Relation Extraction |
10:40–10:50 |
Refining Sample Embeddings with Relation Prototypes to Enhance Continual Relation Extraction |
10:50–10:57 |
Enhancing Entity Boundary Detection for Better Chinese Named Entity Recognition |
Session 1E: Machine Translation and Multilinguality 1 (Session Chair: Qun Liu) |
|
10:00–10:10 |
Contrastive Learning for Many-to-many Multilingual Neural Machine Translation |
10:10–10:20 |
Understanding the Properties of Minimum Bayes Risk Decoding in Neural Machine Translation |
10:20–10:30 |
Multi-Head Highly Parallelized LSTM Decoder for Neural Machine Translation |
10:30–10:40 |
A Bidirectional Transformer Based Alignment Model for Unsupervised Word Alignment |
10:40–10:50 |
Learning Language Specific Sub-network for Multilingual Machine Translation |
10:50–10:57 |
Difficulty-Aware Machine Translation Evaluation |
Session 2 |
Session 2A: Sentiment Analysis, Stylistic Analysis, and Argument Mining 1 (Session Chair: Yulan He) |
11:00–11:10 |
Exploring the Efficacy of Automatically Generated Counterfactuals for Sentiment Analysis |
11:10–11:20 |
Bridge-Based Active Domain Adaptation for Aspect Term Extraction |
11:20–11:30 |
Multimodal Sentiment Detection Based on Multi-channel Graph Neural Networks |
11:30–11:40 |
Aspect-Category-Opinion-Sentiment Quadruple Extraction with Implicit Aspects and Opinions |
11:40–11:47 |
Uncertainty and Surprisal Jointly Deliver the Punchline: Exploiting Incongruity-Based Features for Humor Recognition |
11:47–11:54 |
Counterfactuals to Control Latent Disentangled Text Representations for Style Transfer |
Session 2B: Summarization 1 (Session Chair: Min-Yen Kan) |
|
11:00–11:10 |
PASS: Perturb-and-Select Summarizer for Product Reviews |
11:10–11:20 |
Deep Differential Amplifier for Extractive Summarization |
11:20–11:30 |
Multi-TimeLine Summarization (MTLS): Improving Timeline Summarization by Generating Multiple Summaries |
11:30–11:40 |
Self-Supervised Multimodal Opinion Summarization |
11:40–11:50 |
A Training-free and Reference-free Summarization Evaluation Metric via Centrality-weighted Relevance and Self-referenced Redundancy |
11:50–12:00 |
DESCGEN: A Distantly Supervised Datasetfor Generating Entity Descriptions |
Session 2C: Interpretability and Analysis of Models for NLP 1 (Session Chair: Yonatan Belinkov) |
|
11:00–11:10 |
Introducing Orthogonal Constraint in Structural Probes |
11:10–11:20 |
Hidden Killer: Invisible Textual Backdoor Attacks with Syntactic Trigger |
11:20–11:30 |
Examining the Inductive Bias of Neural Language Models with Artificial Languages |
11:30–11:40 |
Explaining Contextualization in Language Models using Visual Analytics |
11:40–11:50 |
Improving the Faithfulness of Attention-based Explanations with Task-specific Information for Text Classification |
11:50–11:57 |
Attention Flows are Shapley Value Explanations |
Session 2D: Language Grounding to Vision, Robotics and Beyond 1 (Session Chair: Quan Liu) |
|
11:00–11:10 |
Generating Landmark Navigation Instructions from Maps as a Graph-to-Text Problem |
11:10–11:20 |
E2E-VLP: End-to-End Vision-Language Pre-training Enhanced by Visual Learning |
11:20–11:30 |
Learning Relation Alignment for Calibrated Cross-modal Retrieval |
11:30–11:40 |
KM-BART: Knowledge Enhanced Multimodal BART for Visual Commonsense Generation |
11:40–11:47 |
Video Paragraph Captioning as a Text Summarization Task |
11:47–11:54 |
Are VQA Systems RAD? Measuring Robustness to Augmented Data with Focused Interventions |
Session 2E: Machine Learning for NLP 1 (Session Chair: Naoaki Okazaki) |
|
11:00–11:10 |
Cascaded Head-colliding Attention |
11:10–11:20 |
Structural Knowledge Distillation: Tractably Distilling Information for Structured Predictor |
11:20–11:30 |
Parameter-efficient Multi-task Fine-tuning for Transformers via Shared Hypernetworks |
11:30–11:40 |
COSY: COunterfactual SYntax for Cross-Lingual Understanding |
11:40–11:50 |
OoMMix: Out-of-manifold Regularization in Contextual Embedding Space for Text Classification |
11:50–11:57 |
How Helpful is Inverse Reinforcement Learning for Table-to-Text Generation? |
Session 3 |
Session 3A: Computational Social Science and Cultural Analytics 2 (Session Chair: Rob Voigt) |
14:00–14:10 |
Understanding and Countering Stereotypes: A Computational Approach to the Stereotype Content Model |
14:10–14:20 |
Structurizing Misinformation Stories via Rationalizing Fact-Checks |
14:20–14:30 |
Modeling Language Usage and Listener Engagement in Podcasts |
14:30–14:40 |
Breaking Down the Invisible Wall of Informal Fallacies in Online Discussions |
14:40–14:50 |
SocAoG: Incremental Graph Parsing for Social Relation Inference in Dialogues |
14:50–14:57 |
Automatic Fake News Detection: Are Models Learning to Reason? |
Session 3B: Dialog and Interactive Systems 2 (Session Chair: Ioannis Konstas) |
|
14:00–14:10 |
TicketTalk: Toward human-level performance with end-to-end, transaction-based dialog systems |
14:10–14:20 |
Improving Dialog Systems for Negotiation with Personality Modeling |
14:20–14:30 |
Learning from Perturbations: Diverse and Informative Dialogue Generation with Inverse Adversarial Training |
14:30–14:40 |
Increasing Faithfulness in Knowledge-Grounded Dialogue with Controllable Features |
14:40–14:47 |
Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries |
14:47–14:54 |
N-Best ASR Transformer: Enhancing SLU Performance using Multiple ASR Hypotheses |
Session 3C: Information Extraction 2 (Session Chair: Parisa Kordjamshidi) |
|
14:00–14:10 |
CitationIE: Leveraging the Citation Graph for Scientific Information Extraction |
14:10–14:20 |
From Discourse to Narrative: Knowledge Projection for Event Relation Extraction |
14:20–14:30 |
AdvPicker: Effectively Leveraging Unlabeled Data via Adversarial Discriminator for Cross-Lingual NER |
14:30–14:40 |
Compare to The Knowledge: Graph Neural Fake News Detection with External Knowledge |
14:40–14:50 |
Discontinuous Named Entity Recognition as Maximal Clique Discovery |
14:50–15:00 |
LNN-EL: A Neuro-Symbolic Approach to Short-text Entity Linking |
Session 3D: Machine Translation and Multilinguality 2 (Session Chair: Matthias Gallé) |
|
14:00–14:10 |
Do Context-Aware Translation Models Pay the Right Attention? |
14:10–14:20 |
Adapting High-resource NMT Models to Translate Low-resource Related Languages without Parallel Data |
14:20–14:30 |
Bilingual Lexicon Induction via Unsupervised Bitext Construction and Word Alignment |
14:30–14:40 |
Multilingual Speech Translation from Efficient Finetuning of Pretrained Models |
14:40–14:47 |
Gender bias amplification during Speed-Quality optimization in Neural Machine Translation |
14:47–14:54 |
Machine Translation into Low-resource Language Varieties |
Session 3E: Interpretability and Analysis of Models for NLP 2 (Session Chair: Sebastian Gehrmann) |
|
14:00–14:10 |
Learning Faithful Representations of Causal Graphs |
14:10–14:20 |
What Context Features Can Transformer Language Models Use? |
14:20–14:30 |
Integrated Directional Gradients: Feature Interaction Attribution for Neural NLP Models |
14:30–14:37 |
Is Sparse Attention more Interpretable? |
14:37–14:44 |
The Case for Translation-Invariant Self-Attention in Transformer-Based Language Models |
14:44–14:51 |
Relative Importance in Sentence Processing |
Poster 1A: Semantics: Sentence-level Semantics, Textual Inference and Other areas |
|
15:00–17:00 |
DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations |
15:00–17:00 |
Doing Good or Doing Right? Exploring the Weakness of Commonsense Causal Reasoning Models |
15:00–17:00 |
XLPT-AMR: Cross-Lingual Pre-Training via Multi-Task Learning for Zero-Shot AMR Parsing and Text Generation |
15:00–17:00 |
Span-based Semantic Parsing for Compositional Generalization |
15:00–17:00 |
AND does not mean OR: Using Formal Languages to Study Language Models’ Representations |
15:00–17:00 |
Enforcing Consistency in Weakly Supervised Semantic Parsing |
15:00–17:00 |
Compositional Generalization and Natural Language Variation: Can a Semantic Parsing Approach Handle Both? |
Poster 1B: Linguistic Theories, Cognitive Modeling and Psycholinguistics |
|
15:00–17:00 |
A Targeted Assessment of Incremental Processing in Neural Language Models and Humans |
Poster 1C: Semantics: Lexical Semantics |
|
15:00–17:00 |
The Possible, the Plausible, and the Desirable: Event-Based Modality Detection for Language Processing |
Poster 1D: Phonology, Morphology and Word Segmentation |
|
15:00–17:00 |
To POS Tag or Not to POS Tag: The Impact of POS Tags on Morphological Learning in Low-Resource Settings |
Poster 1E: Speech and Multimodality |
|
15:00–17:00 |
Prosodic segmentation for parsing spoken dialogue |
15:00–17:00 |
VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation |
15:00–17:00 |
An Improved Model for Voicing Silent Speech |
Poster 1F: Ethics in NLP |
|
15:00–17:00 |
What’s in the Box? An Analysis of Undesirable Content in the Common Crawl Corpus |
15:00–17:00 |
Stereotyping Norwegian Salmon: An Inventory of Pitfalls in Fairness Benchmark Datasets |
Poster 1G: Information Retrieval and Text Mining |
|
15:00–17:00 |
Robust Knowledge Graph Completion with Stacked Convolutions and a Student Re-Ranking Network |
15:00–17:00 |
A DQN-based Approach to Finding Precise Evidences for Fact Verification |
Poster 1H: Machine Learning for NLP |
|
15:00–17:00 |
The Art of Abstention: Selective Prediction and Error Regularization for Natural Language Processing |
15:00–17:00 |
Unsupervised Out-of-Domain Detection via Pre-trained Transformers |
15:00–17:00 |
Continual Quality Estimation with Online Bayesian Meta-Learning |
15:00–17:00 |
MATE-KD: Masked Adversarial TExt, a Companion to Knowledge Distillation |
15:00–17:00 |
Selecting Informative Contexts Improves Language Model Fine-tuning |
15:00–17:00 |
Explainable Prediction of Text Complexity: The Missing Preliminaries for Text Simplification |
15:00–17:00 |
Multi-Task Retrieval for Knowledge-Intensive Tasks |
Poster 1I: Interpretability and Analysis of Models for NLP |
|
15:00–17:00 |
When Do You Need Billions of Words of Pretraining Data? |
15:00–17:00 |
Analyzing the Source and Target Contributions to Predictions in Neural Machine Translation |
15:00–17:00 |
Comparing Test Sets with Item Response Theory |
15:00–17:00 |
Uncovering Constraint-Based Behavior in Neural Models via Targeted Fine-Tuning |
15:00–17:00 |
More Identifiable yet Equally Performant Transformers for Text Classification |
Poster 1J: Dialog and Interactive Systems |
|
15:00–17:00 |
AugNLG: Few-shot Natural Language Generation using Self-trained Data Augmentation |
15:00–17:00 |
A Span-based Dynamic Local Attention Model for Sequential Sentence Classification |
Poster 1K: Resources and Evaluation |
|
15:00–17:00 |
How effective is BERT without word ordering? Implications for language understanding and data privacy |
15:00–17:00 |
Can vectors read minds better than experts? Comparing data augmentation strategies for the automated scoring of children’s mindreading ability |
15:00–17:00 |
A Dataset and Baselines for Multilingual Reply Suggestion |
15:00–17:00 |
WikiSum: Coherent Summarization Dataset for Efficient Human-Evaluation |
15:00–17:00 |
What Ingredients Make for an Effective Crowdsourcing Protocol for Difficult NLU Data Collection Tasks? |
15:00–17:00 |
UMIC: An Unreferenced Metric for Image Captioning via Contrastive Learning |
15:00–17:00 |
Neural OCR Post-Hoc Correction of Historical Corpora |
Poster 1L: Computational Social Science and Cultural Analytics |
|
15:00–17:00 |
Align Voting Behavior with Public Statements for Legislator Representation Learning |
15:00–17:00 |
Measure and Evaluation of Semantic Divergence across Two Languages |
Poster 1M: Machine Translation and Multilinguality |
|
15:00–17:00 |
Improving Zero-Shot Translation by Disentangling Positional Information |
15:00–17:00 |
Common Sense Beyond English: Evaluating and Improving Multilingual Language Models for Commonsense Reasoning |
15:00–17:00 |
Attention Calibration for Transformer in Neural Machine Translation |
15:00–17:00 |
Anchor-based Bilingual Word Embeddings for Low-Resource Languages |
15:00–17:00 |
Diverse Pretrained Context Encodings Improve Document Translation |
15:00–17:00 |
Multilingual Agreement for Multilingual Neural Machine Translation |
15:00–17:00 |
Exploiting Language Relatedness for Low Web-Resource Language Model Adaptation: An Indic Languages Study |
Poster 1N: Syntax: Tagging, Chunking, and Parsing |
|
15:00–17:00 |
On Finding the K-best Non-projective Dependency Trees |
15:00–17:00 |
Higher-order Derivatives of Weighted Finite-state Machines |
Poster 1O: Theme |
|
15:00–17:00 |
Towards Argument Mining for Social Good: A Survey |
15:00–17:00 |
Automated Generation of Storytelling Vocabulary from Photographs for use in AAC |
Poster 1P: NLP Applications |
|
15:00–17:00 |
CLIP: A Dataset for Extracting Action Items for Physicians from Hospital Discharge Notes |
15:00–17:00 |
Assessing Emoji Use in Modern Text Processing Tools |
15:00–17:00 |
Select, Extract and Generate: Neural Keyphrase Generation with Layer-wise Coverage Attention |
Poster 1Q: Language Generation |
|
15:00–17:00 |
Factorising Meaning and Form for Intent-Preserving Paraphrasing |
15:00–17:00 |
AggGen: Ordering and Aggregating while Generating |
15:00–17:00 |
Reflective Decoding: Beyond Unidirectional Generation with Off-the-Shelf Language Models |
15:00–17:00 |
Towards Table-to-Text Generation with Numerical Reasoning |
15:00–17:00 |
Data-to-text Generation with Macro Planning |
Poster 1R: Summarization |
|
15:00–17:00 |
BACO: A Background Knowledge- and Content-Based Framework for Citing Sentence Generation |
15:00–17:00 |
Language Model as an Annotator: Exploring DialoGPT for Dialogue Summarization |
15:00–17:00 |
Reinforcement Learning for Abstractive Question Summarization with Question-aware Semantic Rewards |
Poster 1S: Question Answering |
|
15:00–17:00 |
Challenges in Information-Seeking QA: Unanswerable Questions and Paragraph Retrieval |
15:00–17:00 |
A Semantics-aware Transformer Model of Relation Linking for Knowledge Base Question Answering |
15:00–17:00 |
A Gradually Soft Multi-Task and Data-Augmented Approach to Medical Question Understanding |
15:00–17:00 |
Neural Retrieval for Question Answering with Cross-Attention Supervised Data Augmentation |
Poster 1T: Language Grounding to Vision, Robotics and Beyond |
|
15:00–17:00 |
Enhancing Descriptive Image Captioning with Natural Language Inference |
Poster 1U: Information Extraction |
|
15:00–17:00 |
Leveraging Type Descriptions for Zero-shot Named Entity Recognition and Classification |
15:00–17:00 |
MECT: Multi-Metadata Embedding based Cross-Transformer for Chinese Named Entity Recognition |
15:00–17:00 |
MOLEMAN: Mention-Only Linking of Entities with a Mention Annotation Network |
15:00–17:00 |
Factuality Assessment as Modal Dependency Parsing |
Poster 1V: Sentiment Analysis, Stylistic Analysis, and Argument Mining |
|
15:00–17:00 |
Directed Acyclic Graph Network for Conversational Emotion Recognition |
15:00–17:00 |
Improving Formality Style Transfer with Context-Aware Rule Injection |
15:00–17:00 |
Topic-Driven and Knowledge-Aware Transformer for Dialogue Emotion Detection |
15:00–17:00 |
Syntopical Graphs for Computational Argumentation Tasks |
15:00–17:00 |
Stance Detection in COVID-19 Tweets |
15:00–17:00 |
eMLM: A New Pre-training Objective for Emotion Related Tasks |
15:00–17:00 |
Topic-Aware Evidence Reasoning and Stance-Aware Aggregation for Fact Verification |
17:00—18:00 |
Keynote 2: Learning and Processing Language from Wearables: Opportunities and Challenges (Alejandrina Cristia) |
Session 4 |
Session 4A: Computational Social Science and Cultural Analytics 3 (Session Chair: Jing Li) |
23:00–23:10 |
Changes in European Solidarity Before and During COVID-19: Evidence from a Large Crowd- and Expert-Annotated Twitter Dataset |
23:10–23:20 |
Measuring Conversational Uptake: A Case Study on Student-Teacher Interactions |
23:20–23:30 |
A Survey of Code-switching: Linguistic and Social Perspectives for Language Technologies |
23:30–23:40 |
Learning from the Worst: Dynamically Generated Datasets to Improve Online Hate Detection |
23:40–23:50 |
InfoSurgeon: Cross-Media Fine-grained Information Consistency Checking for Fake News Detection |
23:50–23:57 |
On Positivity Bias in Negative Reviews |
Session 4B: Dialog and Interactive Systems 3 (Session Chair: Zhou Yu) |
|
23:00–23:10 |
I like fish, especially dolphins: Addressing Contradictions in Dialogue Modeling |
23:10–23:20 |
A Sequence-to-Sequence Approach to Dialogue State Tracking |
23:20–23:30 |
Discovering Dialog Structure Graph for Coherent Dialog Generation |
23:30–23:40 |
Dialogue Response Selection with Hierarchical Curriculum Learning |
23:40–23:50 |
A Joint Model for Dropped Pronoun Recovery and Conversational Discourse Parsing in Chinese Conversational Speech |
23:50–23:57 |
PRAL: A Tailored Pre-Training Model for Task-Oriented Dialog Generation |
Session 4C: Information Extraction 3 (Session Chair: Wenhan Xiong) |
|
23:00–23:10 |
A Systematic Investigation of KB-Text Embedding Alignment at Scale |
23:10–23:20 |
Named Entity Recognition with Small Strongly Labeled and Large Weakly Labeled Data |
23:20–23:30 |
Ultra-Fine Entity Typing with Weak Supervision from a Masked Language Model |
23:30–23:40 |
Improving Named Entity Recognition by External Context Retrieving and Cooperative Learning |
23:40–23:47 |
ROPE: Reading Order Equivariant Positional Encoding for Graph-based Document Information Extraction |
23:47–23:54 |
Zero-shot Event Extraction via Transfer Learning: Challenges and Insights |
Session 4D: Interpretability and Analysis of Models for NLP 3 (Session Chair: Niranjan Balasubramanian) |
|
23:00–23:10 |
Implicit Representations of Meaning in Neural Language Models |
23:10–23:20 |
Causal Analysis of Syntactic Agreement Mechanisms in Neural Language Models |
23:20–23:30 |
Bird’s Eye: Probing for Linguistic Graph Structures with a Simple Information-Theoretic Approach |
23:30–23:40 |
Knowledgeable or Educated Guess? Revisiting Language Models as Knowledge Bases |
23:40–23:50 |
Poisoning Knowledge Graph Embeddings via Relation Inference Patterns |
23:50–23:57 |
Using Adversarial Attacks to Reveal the Statistical Bias in Machine Reading Comprehension Models |
Session 4E: Ethics in NLP 1 (Session Chair: Kai-Wei Chang) |
|
23:00–23:10 |
Bad Seeds: Evaluating Lexical Methods for Bias Measurement |
23:10–23:20 |
A Survey of Race, Racism, and Anti-Racism in NLP |
23:20–23:30 |
Intrinsic Bias Metrics Do Not Correlate with Application Bias |
23:30–23:40 |
RedditBias: A Real-World Resource for Bias Evaluation and Debiasing of Conversational Language Models |
23:40–23:47 |
Quantifying and Avoiding Unfair Qualification Labour in Crowdsourcing |
23:47–23:54 |
Men Are Elected, Women Are Married: Events Gender Bias on Wikipedia |
Tuesday, August 3, 2021 (all times are UTC+0) |
|
Session 5 |
Session 5A: Machine Translation and Multilinguality 3 (Session Chair: Tong Xiao) |
00:00–00:10 |
Contributions of Transformer Attention Heads in Multi- and Cross-lingual Tasks |
00:10–00:20 |
Crafting Adversarial Examples for Neural Machine Translation |
00:20–00:30 |
UXLA: A Robust Unsupervised Data Augmentation Framework for Zero-Resource Cross-Lingual NLP |
00:30–00:40 |
Glancing Transformer for Non-Autoregressive Neural Machine Translation |
00:40–00:47 |
Modeling Task-Aware MIMO Cardinality for Efficient Multilingual Neural Machine Translation |
00:47–00:54 |
Adaptive Nearest Neighbor Machine Translation |
Session 5B: Language Grounding to Vision, Robotics and Beyond 2 (Session Chair: Parisa Kordjamshidi) |
|
00:00–00:10 |
Hierarchical Context-aware Network for Dense Video Event Captioning |
00:10–00:20 |
Control Image Captioning Spatially and Temporally |
00:20–00:30 |
Edited Media Understanding Frames: Reasoning About the Intent and Implications of Visual Misinformation |
00:30–00:40 |
PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D World |
00:40–00:50 |
Neural Event Semantics for Grounded Language Understanding |
Session 5C: Machine Learning for NLP 2 (Session Chair: Lili Mou) |
|
00:00–00:10 |
Modeling Fine-Grained Entity Types with Box Embeddings |
00:10–00:20 |
ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information |
00:20–00:30 |
Weight Distillation: Transferring the Knowledge in Neural Network Parameters |
00:30–00:40 |
Optimizing Deeper Transformers on Small Datasets |
00:40–00:50 |
BERTAC: Enhancing Transformer-based Language Models with Adversarially Pretrained Convolutional Neural Networks |
00:50–00:57 |
On Orthogonality Constraints for Transformers |
Session 5D: NLP Applications 1 and Ethics (Session Chair: Vinodkumar Prabhakaran) |
|
00:00–00:10 |
COVID-Fact: Fact Extraction and Verification of Real-World Claims on COVID-19 Pandemic |
00:10–00:20 |
Explaining Relationships Between Scientific Documents |
00:20–00:30 |
IrEne: Interpretable Energy Prediction for Transformers |
00:30–00:40 |
Mitigating Bias in Session-based Cyberbullying Detection: A Non-Compromising Approach |
00:40–00:50 |
PlotCoder: Hierarchical Decoding for Synthesizing Visualization Code in Programmatic Context |
00:50–01:00 |
Changing the World by Changing the Data |
Session 6 |
Session 6A: Machine Learning for NLP 3 (Session Chair: Iz Beltagy) |
01:00–01:10 |
EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets |
01:10–01:20 |
On the Effectiveness of Adapter-based Tuning for Pretrained Language Model Adaptation |
01:20–01:30 |
Data Augmentation for Text Generation Without Any Augmented Data |
01:30–01:40 |
KEPLER: A Unified Model for Knowledge Embedding and Pre-trained Language Representation |
01:40–01:50 |
Integrating Semantics and Neighborhood Information with Graph-Driven Generative Models for Document Retrieval |
01:50–01:57 |
Measuring and Improving BERT’s Mathematical Abilities by Predicting the Order of Reasoning. |
Session 6B: Resources and Evaluation 1 (Session Chair: Jackie CK Cheung) |
|
01:00–01:10 |
SMURF: SeMantic and linguistic UndeRstanding Fusion for Caption Evaluation via Typicality Analysis |
01:10–01:20 |
KaggleDBQA: Realistic Evaluation of Text-to-SQL Parsers |
01:20–01:30 |
QASR: QCRI Aljazeera Speech Resource A Large Scale Annotated Arabic Speech Corpus |
01:30–01:40 |
An Empirical Study on Hyperparameter Optimization for Fine-Tuning Pre-trained Language Models |
01:40–01:50 |
Better than Average: Paired Evaluation of NLP systems |
01:50–01:57 |
Happy Dance, Slow Clap: Using Reaction GIFs to Predict Induced Affect on Twitter |
Session 6C: Semantics: Sentence-level Semantics, Textual Inference and Other areas 1 (Session Chair: Elior Sulem) |
|
01:00–01:10 |
Chase: A Large-Scale and Pragmatic Chinese Dataset for Cross-Database Context-Dependent Text-to-SQL |
01:10–01:20 |
CLINE: Contrastive Learning with Semantic Negative Examples for Natural Language Understanding |
01:20–01:30 |
Tree-Structured Topic Modeling with Nonparametric Neural Variational Inference |
01:30–01:40 |
ExCAR: Event Graph Knowledge Enhanced Explainable Causal Reasoning |
01:40–01:50 |
Infusing Finetuning with Semantic Dependencies |
01:50–01:57 |
Exploring Listwise Evidence Reasoning with T5 for Fact Verification |
Session 6D: Sentiment Analysis, Stylistic Analysis, and Argument Mining 2 (Session Chair: Kentaro Inui) |
|
01:00–01:10 |
Distributed Representations of Emotion Categories in Emotion Space |
01:10–01:20 |
Style is NOT a single variable: Case Studies for Cross-Stylistic Language Understanding |
01:20–01:30 |
DynaSent: A Dynamic Benchmark for Sentiment Analysis |
01:30–01:40 |
A Hierarchical VAE for Calibrating Attributes while Generating Text using Normalizing Flow |
01:40–01:50 |
A Unified Generative Framework for Aspect-based Sentiment Analysis |
01:50–02:00 |
Classifying Argumentative Relations Using Logical Mechanisms and Argumentation Schemes |
Session 7 |
Session 7A: Dialog and Interactive Systems 4 (Session Chair: Yun-Nung Chen) |
08:00–08:10 |
Discovering Dialogue Slots with Weak Supervision |
08:10–08:20 |
Enhancing the generalization for Intent Classification and Out-of-Domain Detection in SLU |
08:20–08:30 |
ProtAugment: Intent Detection Meta-Learning through Unsupervised Diverse Paraphrasing |
08:30–08:40 |
Robustness Testing of Language Understanding in Task-Oriented Dialog |
08:40–08:50 |
Comprehensive Study: How the Context Information of Different Granularity Affects Dialogue State Tracking? |
08:50–09:00 |
OTTers: One-turn Topic Transitions for Open-Domain Dialogue |
Session 7B: Semantics: Sentence-level Semantics, Textual Inference and Other areas 2 (Session Chair: Nafise Sadat Moosavi) |
|
08:00–08:10 |
Towards Robustness of Text-to-SQL Models against Synonym Substitution |
08:10–08:20 |
KACE: Generating Knowledge Aware Contrastive Explanations for Natural Language Inference |
08:20–08:30 |
Self-Guided Contrastive Learning for BERT Sentence Representations |
08:30–08:40 |
LGESQL: Line Graph Enhanced Text-to-SQL Model with Mixed Local and Non-Local Relations |
08:40–08:47 |
DefSent: Sentence Embeddings using Definition Sentences |
08:47–08:54 |
Discrete Cosine Transform as Universal Sentence Encoder |
Session 7C: Speech and Multimodality 1 (Session Chair: Hung-yi Lee) |
|
08:00–08:10 |
Multi-stage Pre-training over Simplified Multimodal Pre-training Models |
08:10–08:20 |
Beyond Sentence-Level End-to-End Speech Translation: Context Helps |
08:20–08:30 |
LayoutLMv2: Multi-modal Pre-training for Visually-rich Document Understanding |
08:30–08:40 |
UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal Contrastive Learning |
08:40–08:50 |
Missing Modality Imagination Network for Emotion Recognition with Uncertain Missing Modalities |
08:50–09:00 |
Stacked Acoustic-and-Textual Encoding: Integrating the Pre-trained Models into Speech Translation Encoders |
Session 7D: Syntax: Tagging, Chunking, and Parsing 1 (Session Chair: Yan Song) |
|
08:00–08:10 |
N-ary Constituent Tree Parsing with Recursive Semi-Markov Model |
08:10–08:20 |
Automated Concatenation of Embeddings for Structured Prediction |
08:20–08:30 |
Multi-View Cross-Lingual Structured Prediction with Minimum Supervision |
08:30–08:40 |
The Limitations of Limited Context for Constituency Parsing |
08:40–08:50 |
Neural Bi-Lexicalized PCFG Induction |
Session 7E: Resources and Evaluation 2 (Session Chair: Jose Camacho-Collados) |
|
08:00–08:10 |
Ruddit: Norms of Offensiveness for English Reddit Comments |
08:10–08:20 |
Towards Quantifiable Dialogue Coherence Evaluation |
08:20–08:30 |
Assessing the Representations of Idiomaticity in Vector Models with a Noun Compound Dataset Labeled at Type and Token Levels |
08:30–08:40 |
Factoring Statutory Reasoning as Language Understanding Challenges |
08:40–08:50 |
Evaluating Evaluation Measures for Ordinal Classification and Ordinal Quantification |
08:50–08:57 |
AligNarr: Aligning Narratives on Movies |
Session 8 |
Session 8A: Information Extraction 4 (Session Chair: Danushka Bollegala) |
09:00–09:10 |
Interpretable and Low-Resource Entity Matching via Decoupling Feature Learning from Decision Making |
09:10–09:20 |
Locate and Label: A Two-stage Identifier for Nested Named Entity Recognition |
09:20–09:30 |
Text2Event: Controllable Sequence-to-Structure Generation for End-to-end Event Extraction |
09:30–09:40 |
A Large-Scale Chinese Multimodal NER Dataset with Speech Clues |
09:40–09:50 |
A Neural Transition-based Joint Model for Disease Named Entity Recognition and Normalization |
09:50–10:00 |
OntoED: Low-resource Event Detection with Ontology Embedding |
Session 8B: Machine Translation and Multilinguality 4 (Session Chair: Tao Qin) |
|
09:00–09:10 |
Self-Training Sampling with Monolingual Data Uncertainty for Neural Machine Translation |
09:10–09:20 |
Breaking the Corpus Bottleneck for Context-Aware Neural Machine Translation with Cross-Task Pre-training |
09:20–09:30 |
Guiding Teacher Forcing with Seer Forcing for Neural Machine Translation |
09:30–09:40 |
Cascade versus Direct Speech Translation: Do the Differences Still Make a Difference? |
09:40–09:50 |
Unsupervised Neural Machine Translation for Low-Resource Domains via Meta-Learning |
09:50–09:57 |
An Exploratory Analysis of Multilingual Word-Level Quality Estimation with Cross-Lingual Transformers |
Session 8C: Machine Learning for NLP 4 (Session Chair: Matthias Gallé) |
|
09:00–09:10 |
Lightweight Cross-Lingual Sentence Representation Learning |
09:10–09:20 |
ERNIE-Doc: A Retrospective Long-Document Modeling Transformer |
09:20–09:30 |
Marginal Utility Diminishes: Exploring the Minimum Knowledge for BERT Knowledge Distillation |
09:30–09:40 |
Rational LAMOL: A Rationale-based Lifelong Learning Framework |
09:40–09:50 |
EnsLM: Ensemble Language Model for Data Diversity by Semantic Clustering |
09:50–10:00 |
LeeBERT: Learned Early Exit for BERT with cross-level optimization |
Session 8D: NLP Applications 2 (Session Chair: Preslav Nakov) |
|
09:00–09:10 |
Unsupervised Extractive Summarization-Based Representations for Accurate and Explainable Collaborative Filtering |
09:10–09:20 |
PLOME: Pre-training with Misspelled Knowledge for Chinese Spelling Correction |
09:20–09:30 |
Competence-based Multimodal Curriculum Learning for Medical Report Generation |
09:30–09:40 |
Learning Syntactic Dense Embedding with Correlation Graph for Automatic Readability Assessment |
09:40–09:50 |
Meta-KD: A Meta Knowledge Distillation Framework for Language Model Compression across Domains |
09:50–09:57 |
Exploration and Exploitation: Two Ways to Improve Chinese Spelling Correction Models |
Session 8E: Question Answering 1 (Session Chair: Mrinmaya Sachan) |
|
09:00–09:10 |
A Semantic-based Method for Unsupervised Commonsense Question Answering |
09:10–09:20 |
Explanations for CommonsenseQA: New Dataset and Models |
09:20–09:30 |
Few-Shot Question Answering by Pretraining Span Selection |
09:30–09:40 |
UnitedQA: A Hybrid Approach for Open Domain Question Answering |
09:40–09:50 |
Database reasoning over text |
09:50–09:57 |
Training Adaptive Computation for Open-Domain Question Answering with Computational Constraints |
Session 9 |
Session 9A: Machine Translation and Multilinguality 5 (Session Chair: Lijun Wu) |
10:00–10:10 |
Online Learning Meets Machine Translation Evaluation: Finding the Best Systems with the Least Human Effort |
10:10–10:20 |
How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models |
10:20–10:30 |
Evaluating morphological typology in zero-shot cross-lingual transfer |
10:30–10:40 |
From Machine Translation to Code-Switching: Generating High-Quality Code-Switched Text |
10:40–10:50 |
Fast and Accurate Neural Machine Translation with Translation Memory |
10:50–10:57 |
An Empirical Study on Adversarial Attack on NMT: Languages and Positions Matter |
Session 9B: Resources and Evaluation 3 (Session Chair: Margot Mieskes) |
|
10:00–10:10 |
Annotating Online Misogyny |
10:10–10:20 |
Few-NERD: A Few-shot Named Entity Recognition Dataset |
10:20–10:30 |
MultiMET: A Multimodal Dataset for Metaphor Understanding |
10:30–10:40 |
Human-in-the-Loop for Data Collection: a Multi-Target Counter Narrative Dataset to Fight Online Hate Speech |
10:40–10:47 |
OntoGUM: Evaluating Contextualized SOTA Coreference Resolution on 12 More Genres |
Session 9C: Question Answering 2 (Session Chair: Minjoon Seo) |
|
10:00–10:10 |
Can Generative Pre-trained Language Models Serve As Knowledge Bases for Closed-book QA? |
10:10–10:20 |
Joint Models for Answer Verification in Question Answering Systems |
10:20–10:30 |
Answering Ambiguous Questions through Generative Evidence Fusion and Round-Trip Prediction |
10:30–10:40 |
TAT-QA: A Question Answering Benchmark on a Hybrid of Tabular and Textual Content in Finance |
10:40–10:50 |
Modeling Transitions of Focal Entities for Conversational Knowledge Base Question Answering |
10:50–10:57 |
In Factuality: Efficient Integration of Relevant Facts for Visual Question Answering |
Session 9D: Semantics: Sentence-level Semantics, Textual Inference and Other areas 3 (Session Chair: Jacob Andreas) |
|
10:00–10:10 |
Evidence-based Factual Error Correction |
10:10–10:20 |
Probabilistic, Structure-Aware Algorithms for Improved Variety, Accuracy, and Coverage of AMR Alignments |
10:20–10:30 |
Meta-Learning to Compositionally Generalize |
10:30–10:40 |
Taming Pre-trained Language Models with N-gram Representations for Low-Resource Domain Adaptation |
10:40–10:50 |
ERICA: Improving Entity and Relation Understanding for Pre-trained Language Models via Contrastive Learning |
10:50–10:57 |
Zero-shot Fact Verification by Claim Generation |
Session 9E: Sentiment Analysis, Stylistic Analysis, and Argument Mining 3 (Session Chair: Sadao Kurohashi) |
|
10:00–10:10 |
Position Bias Mitigation: A Knowledge-Aware Graph Model for Emotion Cause Extraction |
10:10–10:20 |
Every Bite Is an Experience: Key Point Analysis of Business Reviews |
10:20–10:30 |
Structured Sentiment Analysis as Dependency Graph Parsing |
10:30–10:37 |
Thank you BART! Rewarding Pre-Trained Models Improves Formality Style Transfer |
10:37–10:44 |
Deep Context- and Relation-Aware Learning for Aspect-based Sentiment Analysis |
10:44–10:51 |
Towards Generative Aspect-Based Sentiment Analysis |
Session 10 |
Session 10A: Machine Translation and Multilinguality 6 (Session Chair: Tong Xiao) |
11:00–11:10 |
Consistency Regularization for Cross-Lingual Fine-Tuning |
11:10–11:20 |
Improving Pretrained Cross-Lingual Language Models via Self-Labeled Word Alignment |
11:20–11:30 |
Rejuvenating Low-Frequency Words: Making the Most of Parallel Data in Non-Autoregressive Translation |
11:30–11:40 |
G-Transformer for Document-Level Machine Translation |
11:40–11:50 |
Prevent the Language Model from being Overconfident in Neural Machine Translation |
11:50–11:57 |
Bilingual Mutual Information Based Adaptive Training for Neural Machine Translation |
Session 10B: Dialog and Interactive Systems 5 (Session Chair: Alborz Geramifard) |
|
11:00–11:10 |
Towards Emotional Support Dialog Systems |
11:10–11:20 |
Novel Slot Detection: A Benchmark for Discovering Unknown Slot Types in the Task-Oriented Dialogue System |
11:20–11:30 |
GTM: A Generative Triple-wise Model for Conversational Question Generation |
11:30–11:40 |
Diversifying Dialog Generation via Adaptive Label Smoothing |
11:40–11:50 |
Out-of-Scope Intent Detection with Self-Supervision and Discriminative Training |
11:50–11:57 |
Continual Learning for Task-oriented Dialogue System with Iterative Network Pruning, Expanding and Masking |
Session 10C: Information Extraction 5 (Session Chair: Tristan Naumann) |
|
11:00–11:10 |
Document-level Event Extraction via Heterogeneous Graph-based Interaction Model with a Tracker |
11:10–11:20 |
Nested Named Entity Recognition via Explicitly Excluding the Influence of the Best Path |
11:20–11:30 |
LearnDA: Learnable Knowledge-Guided Data Augmentation for Event Causality Identification |
11:30–11:40 |
Revisiting the Negative Data of Distantly Supervised Relation Extraction |
11:40–11:50 |
Knowing the No-match: Entity Alignment with Dangling Cases |
11:50–11:57 |
TIMERS: Document-level Temporal Relation Extraction |
Session 10D: Phonology, Morphology and Word Segmentation 1 (Session Chair: Xuanjing Huang) |
|
11:00–11:10 |
Superbizarre Is Not Superb: Derivational Morphology Improves BERT’s Interpretation of Complex Words |
11:10–11:20 |
Optimizing over Subsequences Generates Context-Sensitive Languages |
11:20–11:30 |
Morphology Matters: A Multilingual Language Modeling Analysis |
11:30–11:37 |
Improving Arabic Diacritization with Regularized Decoding and Adversarial Training |
11:37–11:44 |
When is Char Better Than Subword: A Systematic Study of Segmentation Algorithms for Neural Machine Translation |
11:44–11:51 |
More than Text: Multi-modal Chinese Word Segmentation |
Session 10E: Semantics: Lexical Semantics 1 (Session Chair: Danushka Bollegala) |
|
11:00–11:10 |
BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language Models Identify Analogies? |
11:10–11:20 |
Exploring the Representation of Word Meanings in Context: A Case Study on Homonymy and Synonymy |
11:20–11:30 |
Measuring Fine-Grained Domain Relevance of Terms: A Hierarchical Core-Fringe Approach |
11:30–11:37 |
A Mixture-of-Experts Model for Antonym-Synonym Discrimination |
11:37–11:44 |
Learning Domain-Specialised Representations for Cross-Lingual Biomedical Entity Linking |
11:44–11:51 |
A Cluster-based Approach for Improving Isotropy in Contextual Embedding Space |
14:00–15:30 |
Business meeting and Green NLP panel |
15:30–16:30 |
Keynote 3: Reliable Characterizations of NLP Systems as a Social Responsibility (Christopher Potts) |
Session 11 |
Session 11A: Dialog and Interactive Systems 6 (Session Chair: Maryam Fazel-Zarandi) |
16:30–16:40 |
HERALD: An Annotation Efficient Method to Detect User Disengagement in Social Conversations |
16:40–16:50 |
Value-Agnostic Conversational Semantic Parsing |
16:50–17:00 |
MPC-BERT: A Pre-Trained Language Model for Multi-Party Conversation Understanding |
17:00–17:10 |
Best of Both Worlds: Making High Accuracy Non-incremental Transformer-based Disfluency Detection Incremental |
17:10–17:20 |
NeuralWOZ: Learning to Collect Task-Oriented Dialogue via Model-Based Simulation |
17:20–17:27 |
Unsupervised Enrichment of Persona-grounded Dialog with Background Stories |
Session 11B: Linguistic Theories, Cognitive Modeling and Psycholinguistics 1 (Session Chair: Kyle Mahowald) |
|
16:30–16:40 |
CDRNN: Discovering Complex Dynamics in Human Language Processing |
16:40–16:50 |
Structural Guidance for Transformer Language Models |
16:50–17:00 |
Surprisal Estimators for Human Reading Times Need Character Models |
17:00–17:10 |
CogAlign: Learning to Align Textual Neural Representations to Cognitive Language Processing Signals |
17:10–17:20 |
Formal Basis of a Language Universal |
17:20–17:27 |
Beyond Laurel/Yanny: An Autoencoder-Enabled Search for Polyperceivable Audio |
Session 11C: Machine Learning for NLP 5 (Session Chair: Jacob Andreas) |
|
16:30–16:40 |
Self-Attention Networks Can Process Bounded Hierarchical Languages |
16:40–16:50 |
TextSETTR: Few-Shot Text Style Extraction and Tunable Targeted Restyling |
16:50–17:00 |
H-Transformer-1D: Fast One-Dimensional Hierarchical Attention for Sequences |
17:00–17:10 |
Making Pre-trained Language Models Better Few-shot Learners |
17:10–17:20 |
A Sweet Rabbit Hole by DARCY: Using Honeypots to Detect Universal Trigger’s Adversarial Attacks |
17:20–17:27 |
Don’t Let Discourse Confine Your Model: Sequence Perturbations for Improved Event Language Models |
Session 11D: Information Retrieval and Text Mining 1 (Session Chair: Thuy Vu) |
|
16:30–16:40 |
Towards Propagation Uncertainty: Edge-enhanced Bayesian Graph Convolutional Networks for Rumor Detection |
16:40–16:50 |
Label-Specific Dual Graph Neural Network for Multi-Label Text Classification |
16:50–17:00 |
TAN-NTM: Topic Attention Networks for Neural Topic Modeling |
17:00–17:10 |
Cross-language Sentence Selection via Data Augmentation and Rationale Training |
17:10–17:20 |
A Neural Model for Joint Document and Snippet Ranking in Question Answering for Large Document Collections |
17:20–17:27 |
The Curse of Dense Low-Dimensional Information Retrieval for Large Index Sizes |
Session 11E: Discourse and Pragmatics 1 (Session Chair: Vera Demberg) |
|
16:30–16:40 |
W-RST: Towards a Weighted RST-style Discourse Framework |
16:40–16:50 |
ABCD: A Graph Framework to Convert Complex Sentences to a Covering Set of Simple Sentences |
16:50–17:00 |
Which Linguist Invented the Lightbulb? Presupposition Verification for Question-Answering |
17:00–17:10 |
Adversarial Learning for Discourse Rhetorical Structure Parsing |
17:10–17:20 |
Exploring Discourse Structures for Argument Impact Classification |
Session 12 |
Session 12A: Machine Translation and Multilinguality 7 (Session Chair: Jiatao Gu) |
23:00–23:10 |
Point, Disambiguate and Copy: Incorporating Bilingual Dictionaries for Neural Machine Translation |
23:10–23:20 |
VECO: Variable and Flexible Cross-lingual Pre-training for Language Understanding and Generation |
23:20–23:30 |
A unified approach to sentence segmentation of punctuated text in many languages |
23:30–23:40 |
Towards User-Driven Neural Machine Translation |
23:40–23:50 |
End-to-End Lexically Constrained Machine Translation for Morphologically Rich Languages |
23:50–23:57 |
Cross-lingual Text Classification with Heterogeneous Graph Neural Network |
Session 12B: Resources and Evaluation 4 (Session Chair: Gina-Anne Levow) |
|
23:00–23:10 |
Handling Extreme Class Imbalance in Technical Logbook Datasets |
23:10–23:20 |
ILDC for CJPE: Indian Legal Documents Corpus for Court Judgment Prediction and Explanation |
23:20–23:30 |
Supporting Cognitive and Emotional Empathic Writing of Students |
23:30–23:40 |
Context-aware Adversarial Training for Name Regularity Bias in Named Entity Recognition |
23:40–23:50 |
SummEval: Re-evaluating Summarization Evaluation |
23:50–24:00 |
Towards Question-Answering as an Automatic Metric for Evaluating the Content Quality of a Summary |
Session 12C: Question Answering 3 (Session Chair: Siddharth Patwardhan) |
|
23:00–23:10 |
Dual Reader-Parser on Hybrid Textual and Tabular Evidence for Open Domain Question Answering |
23:10–23:20 |
Generation-Augmented Retrieval for Open-Domain Question Answering |
23:20–23:30 |
Check It Again:Progressive Visual Question Answering via Visual Entailment |
23:30–23:40 |
A Mutual Information Maximization Approach for the Spurious Solution Problem in Weakly Supervised Question Answering |
23:40–23:50 |
Relevance-guided Supervision for OpenQA with ColBERT |
23:50–23:57 |
Towards more equitable question answering systems: How much more data do you need? |
Session 12D: Theme 1 (Session Chair: Diyi Yang) |
|
23:00–23:10 |
Breaking Down Walls of Text: How Can NLP Benefit Consumer Privacy? |
23:10–23:20 |
Supporting Land Reuse of Former Open Pit Mining Sites using Text Classification and Active Learning |
23:20–23:30 |
Reliability Testing for Natural Language Processing Systems |
23:30–23:40 |
Learning Language and Multimodal Privacy-Preserving Markers of Mood from Mobile Data |
23:40–23:50 |
Anonymisation Models for Text Data: State of the art, Challenges and Future Directions |
Wednesday, August 4, 2021 (all times UTC+0) |
|
Poster 2A: Semantics: Sentence-level Semantics, Textual Inference and Other areas |
|
0:00–2:00 |
End-to-End AMR Corefencence Resolution |
Poster 2B: Linguistic Theories, Cognitive Modeling and Psycholinguistics |
|
0:00–2:00 |
How is BERT surprised? Layerwise detection of linguistic anomalies |
0:00–2:00 |
Psycholinguistic Tripartite Graph Network for Personality Detection |
Poster 2C: Semantics: Lexical Semantics |
|
0:00–2:00 |
Verb Metaphor Detection via Contextual Relation Learning |
Poster 2D: Speech and Multimodality |
|
0:00–2:00 |
Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task |
Poster 2E: Ethics in NLP |
|
0:00–2:00 |
Probing Toxic Content in Large Pre-Trained Language Models |
0:00–2:00 |
Societal Biases in Language Generation: Progress and Challenges |
Poster 2F: Interpretability and Analysis of Models for NLP |
|
0:00–2:00 |
Reservoir Transformers |
Poster 2G: Machine Learning for NLP |
|
0:00–2:00 |
Subsequence Based Deep Active Learning for Named Entity Recognition |
0:00–2:00 |
Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models |
0:00–2:00 |
BinaryBERT: Pushing the Limit of BERT Quantization |
0:00–2:00 |
Embedding Time Differences in Context-sensitive Neural Networks for Learning Time to Event |
0:00–2:00 |
Are Pretrained Convolutions Better than Pretrained Transformers? |
0:00–2:00 |
PairRE: Knowledge Graph Embeddings via Paired Relation Vectors |
0:00–2:00 |
Improving Compositional Generalization in Classification Tasks via Structure Annotations |
0:00–2:00 |
Learning to Generate Task-Specific Adapters from Task Description |
0:00–2:00 |
Hierarchy-aware Label Semantics Matching Network for Hierarchical Text Classification |
0:00–2:00 |
HiddenCut: Simple Data Augmentation for Natural Language Understanding with Better Generalizability |
0:00–2:00 |
Efficient Content-Based Sparse Attention with Routing Transformers |
Poster 2H: Dialog and Interactive Systems |
|
0:00–2:00 |
Neural Stylistic Response Generation with Disentangled Latent Variables |
0:00–2:00 |
Intent Classification and Slot Filling for Privacy Policies |
0:00–2:00 |
RADDLE: An Evaluation Benchmark and Analysis Platform for Robust Task-oriented Dialog Systems |
0:00–2:00 |
QA-Driven Zero-shot Slot Filling with Weak Supervision Pretraining |
0:00–2:00 |
Domain-Adaptive Pretraining Methods for Dialogue Understanding |
0:00–2:00 |
Semantic Representation for Dialogue Modeling |
0:00–2:00 |
A Pre-training Strategy for Zero-Resource Response Selection in Knowledge-Grounded Conversations |
0:00–2:00 |
SOLOIST: Building Task Bots at Scale with Transfer Learning and Machine Teaching |
Poster 2I: Information Retrieval and Text Mining |
|
0:00–2:00 |
Dependency-driven Relation Extraction with Attentive Graph Convolutional Networks |
0:00–2:00 |
Evaluating Entity Disambiguation and the Role of Popularity in Retrieval-Based NLP |
Poster 2J: Resources and Evaluation |
|
0:00–2:00 |
Targeting the Benchmark: On Methodology in Current Natural Language Processing Research |
0:00–2:00 |
Evaluation Examples are not Equally Informative: How should that change NLP Leaderboards? |
Poster 2K: Computational Social Science and Cultural Analytics |
|
0:00–2:00 |
Claim Matching Beyond English to Scale Global Fact-Checking |
0:00–2:00 |
X-Fact: A New Benchmark Dataset for Multilingual Fact Checking |
Poster 2L: Machine Translation and Multilinguality |
|
0:00–2:00 |
SemFace: Pre-training Encoder and Decoder with a Semantic Interface for Neural Machine Translation |
0:00–2:00 |
Energy-Based Reranking: Improving Neural Machine Translation Using Energy-Based Models |
0:00–2:00 |
nmT5 - Is parallel data still relevant for pre-training massively multilingual language models? |
0:00–2:00 |
Syntax-augmented Multilingual BERT for Cross-lingual Transfer |
0:00–2:00 |
How to Adapt Your Pretrained Multilingual Model to 1600 Languages |
0:00–2:00 |
Synthesizing Parallel Data of User-Generated Texts with Zero-Shot Neural Machine Translation |
Poster 2M: Syntax: Tagging, Chunking, and Parsing |
|
0:00–2:00 |
Weakly Supervised Named Entity Tagging with Learnable Logical Rules |
Poster 2N: NLP Applications |
|
0:00–2:00 |
Question Generation for Adaptive Education |
Poster 2O: Language Generation |
|
0:00–2:00 |
Prefix-Tuning: Optimizing Continuous Prompts for Generation |
0:00–2:00 |
One2Set: Generating Diverse Keyphrases as a Set |
0:00–2:00 |
A Simple Recipe for Multilingual Grammatical Error Correction |
0:00–2:00 |
Continuous Language Generative Flow |
0:00–2:00 |
RYANSQL: Recursively Applying Sketch-based Slot Fillings for Complex Text-to-SQL in Cross-Domain Databases |
Poster 2P: Summarization |
|
0:00–2:00 |
TWAG: A Topic-Guided Wikipedia Abstract Generator |
Poster 2Q: Question Answering |
|
0:00–2:00 |
Towards Visual Question Answering on Pathology Images |
0:00–2:00 |
ForecastQA: A Question Answering Challenge for Event Forecasting with Temporal Text Data |
0:00–2:00 |
Recursive Tree-Structured Self-Attention for Answer Sentence Selection |
Poster 2R: Language Grounding to Vision, Robotics and Beyond |
|
0:00–2:00 |
Efficient Text-based Reinforcement Learning by Jointly Leveraging State and Commonsense Graph Representations |
0:00–2:00 |
mTVR: Multilingual Moment Retrieval in Videos |
Poster 2S: Information Extraction |
|
0:00–2:00 |
How Knowledge Graph and Attention Help? A Qualitative Analysis into Bag-level Relation Extraction |
0:00–2:00 |
Trigger is Not Sufficient: Exploiting Frame-aware Knowledge for Implicit Event Argument Extraction |
0:00–2:00 |
Element Intervention for Open Relation Extraction |
0:00–2:00 |
Explicitly Capturing Relations between Entity Mentions via Graph Neural Networks for Domain-specific Named Entity Recognition |
0:00–2:00 |
AdaTag: Multi-Attribute Value Extraction from Product Profiles with Adaptive Decoding |
0:00–2:00 |
CoRI: Collective Relation Integration with Data Augmentation for Open Information Extraction |
0:00–2:00 |
Benchmarking Scalable Methods for Streaming Cross Document Entity Coreference |
0:00–2:00 |
Search from History and Reason for Future: Two-stage Reasoning on Temporal Knowledge Graphs |
Poster 2T: Sentiment Analysis, Stylistic Analysis, and Argument Mining |
|
0:00–2:00 |
Employing Argumentation Knowledge Graphs for Neural Argument Generation |
0:00–2:00 |
Learning Span-Level Interactions for Aspect Sentiment Triplet Extraction |
Session 13 |
Session 13A: Machine Translation and Multilinguality 8 (Session Chair: Longyue Wang) |
08:00–08:10 |
On Compositional Generalization of Neural Machine Translation |
08:10–08:20 |
Mask-Align: Self-Supervised Neural Word Alignment |
08:20–08:30 |
GWLAN: General Word-Level AutocompletioN for Computer-Aided Translation |
08:30–08:37 |
Improving Lexically Constrained Neural Machine Translation with Source-Conditioned Masked Span Prediction |
Session 13B: Information Extraction 6 (Session Chair: Florian Boudin) |
|
08:00–08:10 |
De-biasing Distantly Supervised Named Entity Recognition via Causal Intervention |
08:10–08:20 |
A Span-Based Model for Joint Overlapped and Discontinuous Named Entity Recognition |
08:20–08:30 |
MLBiNet: A Cross-Sentence Collective Event Detection Network |
08:30–08:40 |
Exploiting Document Structures and Cluster Consistencies for Event Coreference Resolution |
08:40–08:50 |
StereoRel: Relational Triple Extraction from a Stereoscopic Perspective |
08:50–09:00 |
Knowledge-Enriched Event Causality Identification via Latent Structure Induction Networks |
Session 13C: Machine Learning for NLP 6 (Session Chair: Kang Liu) |
|
08:00–08:10 |
Turn the Combination Lock: Learnable Textual Backdoor Attacks via Word Substitution |
08:10–08:20 |
Parameter-Efficient Transfer Learning with Diff Pruning |
08:20–08:30 |
R2D2: Recursive Transformer based on Differentiable Tree for Interpretable Hierarchical Language Modeling |
08:30–08:40 |
Risk Minimization for Zero-shot Sequence Labeling |
08:40–08:50 |
WARP: Word-level Adversarial ReProgramming |
08:50–09:00 |
Lexicon Learning for Few Shot Sequence Modeling |
Session 13D: NLP Applications 3 (Session Chair: Juntao Li) |
|
08:00–08:10 |
Personalized Transformer for Explainable Recommendation |
08:10–08:20 |
Generating SOAP Notes from Doctor-Patient Conversations Using Modular Summarization Techniques |
08:20–08:30 |
Tail-to-Tail Non-Autoregressive Sequence Prediction for Chinese Grammatical Error Correction |
08:30–08:40 |
Early Detection of Sexual Predators in Chats |
08:40–08:50 |
Writing by Memorizing: Hierarchical Retrieval-based Medical Report Generation |
08:50–08:57 |
Quotation Recommendation and Interpretation Based on Transformation from Queries to Quotations |
Session 13E: Information Retrieval and Text Mining 2 (Session Chair: Sarvnaz Karimi) |
|
08:00–08:10 |
Concept-Based Label Embedding via Dynamic Routing for Hierarchical Text Classification |
08:10–08:20 |
VisualSparta: An Embarrassingly Simple Approach to Large-scale Text-to-Image Search with Weighted Bag-of-words |
08:20–08:30 |
Few-Shot Text Ranking with Meta Adapted Synthetic Weak Supervision |
08:30–08:40 |
Semi-Supervised Text Classification with Balanced Deep Representation Distributions |
08:40–08:50 |
Improving Document Representations by Generating Pseudo Query Embeddings for Dense Retrieval |
08:50–08:57 |
Pre-training is a Hot Topic: Contextualized Document Embeddings Improve Topic Coherence |
Poster 3A: Semantics: Sentence-level Semantics, Textual Inference and Other areas |
|
9:00–11:00 |
ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer |
9:00–11:00 |
Exploring Dynamic Selection of Branch Expansion Orders for Code Generation |
9:00–11:00 |
COINS: Dynamically Generating COntextualized Inference Rules for Narrative Story Completion |
9:00–11:00 |
Reasoning over Entity-Action-Location Graph for Procedural Text Understanding |
9:00–11:00 |
From Paraphrasing to Semantic Parsing: Unsupervised Semantic Parsing via Synchronous Semantic Decoding |
9:00–11:00 |
Pre-training Universal Language Representation |
9:00–11:00 |
Structural Pre-training for Dialogue Comprehension |
9:00–11:00 |
AutoTinyBERT: Automatic Hyper-parameter Optimization for Efficient Pre-trained Language Models |
9:00–11:00 |
Data Augmentation with Adversarial Training for Cross-Lingual NLI |
9:00–11:00 |
Input Representations for Parsing Discourse Representation Structures: Comparing English with Chinese |
9:00–11:00 |
Code Generation from Natural Language with Less Prior Knowledge and More Monolingual Data |
9:00–11:00 |
Bootstrapped Unsupervised Sentence Representation Learning |
9:00–11:00 |
Learning Event Graph Knowledge for Abductive Reasoning |
9:00–11:00 |
Issues with Entailment-based Zero-shot Text Classification |
9:00–11:00 |
Neural-Symbolic Commonsense Reasoner with Relation Predictors |
Poster 3B: Linguistic Theories, Cognitive Modeling and Psycholinguistics |
|
9:00–11:00 |
A Cognitive Regularizer for Language Modeling |
9:00–11:00 |
What Motivates You? Benchmarking Automatic Detection of Basic Needs from Short Posts |
9:00–11:00 |
Lower Perplexity is Not Always Human-Like |
Poster 3C: Semantics: Lexical Semantics |
|
9:00–11:00 |
Word Sense Disambiguation: Towards Interactive Context Exploitation from Both Word and Sense Perspectives |
9:00–11:00 |
A Knowledge-Guided Framework for Frame Identification |
9:00–11:00 |
Obtaining Better Static Word Embeddings Using Contextual Embedding Models |
9:00–11:00 |
Meta-Learning with Variational Semantic Memory for Word Sense Disambiguation |
9:00–11:00 |
LexFit: Lexical Fine-Tuning of Pretrained Language Models |
9:00–11:00 |
Semantic Frame Induction using Masked Word Embeddings and Two-Step Clustering |
9:00–11:00 |
Multi-SimLex: A Large-Scale Evaluation of Multilingual and Cross-Lingual Lexical Semantic Similarity |
Poster 3D: Speech and Multimodality |
|
9:00–11:00 |
Text-Free Image-to-Speech Synthesis Using Learned Segmental Units |
9:00–11:00 |
CTFN: Hierarchical Learning for Multimodal Sentiment Analysis Using Coupled-Translation Fusion Network |
9:00–11:00 |
Lightweight Adapter Tuning for Multilingual Speech Translation |
Poster 3E: Interpretability and Analysis of Models for NLP |
|
9:00–11:00 |
Parameter Selection: Why We Should Pay More Attention to It |
9:00–11:00 |
Positional Artefacts Propagate Through Masked Language Model Embeddings |
9:00–11:00 |
Language Model Evaluation Beyond Perplexity |
9:00–11:00 |
Learning to Explain: Generating Stable Explanations Fast |
9:00–11:00 |
StereoSet: Measuring stereotypical bias in pretrained language models |
9:00–11:00 |
Alignment Rationale for Natural Language Inference |
9:00–11:00 |
Enabling Lightweight Fine-tuning for Pre-trained Language Model Compression based on Matrix Product Operators |
9:00–11:00 |
On Sample Based Explanation Methods for NLP: Faithfulness, Efficiency and Semantic Evaluation |
9:00–11:00 |
CausaLM: Causal Model Explanation Through Counterfactual Language Models |
9:00–11:00 |
Amnesic Probing: Behavioral Explanation With Amnesic Counterfactuals |
Poster 3F: Information Retrieval and Text Mining |
|
9:00–11:00 |
Syntax-Enhanced Pre-trained Model |
9:00–11:00 |
Matching Distributions between Model and Data: Cross-domain Knowledge Distillation for Unsupervised Domain Adaptation |
9:00–11:00 |
Counterfactual Inference for Text Classification Debiasing |
9:00–11:00 |
HieRec: Hierarchical User Interest Modeling for Personalized News Recommendation |
9:00–11:00 |
Distinct Label Representations for Few-Shot Text Classification |
9:00–11:00 |
PP-Rec: News Recommendation with Personalized User Interest and Time-aware News Popularity |
9:00–11:00 |
Article Reranking by Memory-Enhanced Key Sentence Matching for Detecting Previously Fact-Checked Claims |
9:00–11:00 |
Learning to Solve NLP Tasks in an Incremental Number of Languages |
Poster 3G: Machine Learning for NLP |
|
9:00–11:00 |
Defense against Synonym Substitution-based Adversarial Attacks via Dirichlet Neighborhood Ensemble |
9:00–11:00 |
Shortformer: Better Language Modeling using Shorter Inputs |
9:00–11:00 |
BanditMTL: Bandit-based Multi-task Learning for Text Classification |
9:00–11:00 |
Unified Interpretation of Softmax Cross-Entropy and Negative Sampling: With Case Study for Knowledge Graph Embedding |
9:00–11:00 |
Hi-Transformer: Hierarchical Interactive Transformer for Efficient and Effective Long Document Modeling |
9:00–11:00 |
De-Confounded Variational Encoder-Decoder for Logical Table-to-Text Generation |
9:00–11:00 |
Rethinking Stealthiness of Backdoor Attack against NLP Models |
9:00–11:00 |
Crowdsourcing Learning as Domain Adaptation: A Case Study on Named Entity Recognition |
9:00–11:00 |
Robust Transfer Learning with Pretrained Language Models through Adapters |
9:00–11:00 |
Embracing Ambiguity: Shifting the Training Target of NLI Models |
9:00–11:00 |
Exploring Distantly-Labeled Rationales in Neural Network Models |
9:00–11:00 |
Learning to Perturb Word Embeddings for Out-of-distribution QA |
Poster 3H: Dialog and Interactive Systems |
|
9:00–11:00 |
Maria: A Visual Experience Powered Conversational Agent |
9:00–11:00 |
A Human-machine Collaborative Framework for Evaluating Malevolence in Dialogues |
9:00–11:00 |
Generating Relevant and Coherent Dialogue Responses using Self-Separated Conditional Variational AutoEncoders |
9:00–11:00 |
Modeling Discriminative Representations for Out-of-Domain Detection with Supervised Contrastive Learning |
9:00–11:00 |
Learning to Ask Conversational Questions by Optimizing Levenshtein Distance |
9:00–11:00 |
DVD: A Diagnostic Dataset for Multi-step Reasoning in Video Grounded Dialogue |
9:00–11:00 |
Preview, Attend and Review: Schema-Aware Curriculum Learning for Multi-Domain Dialogue State Tracking |
9:00–11:00 |
On the Generation of Medical Dialogs for COVID-19 |
9:00–11:00 |
Constructing Multi-Modal Dialogue Dataset by Replacing Text with Semantically Relevant Images |
9:00–11:00 |
MMGCN: Multimodal Fusion via Deep Graph Convolution Network for Emotion Recognition in Conversation |
9:00–11:00 |
DynaEval: Unifying Turn and Dialogue Level Evaluation |
9:00–11:00 |
Unsupervised Learning of KB Queries in Task-Oriented Dialogs |
Poster 3I: Ethics in NLP |
|
9:00–11:00 |
Exposing the limits of Zero-shot Cross-lingual Hate Speech Detection |
Poster 3J: Resources and Evaluation |
|
9:00–11:00 |
CoSQA: 20,000+ Web Queries for Code Search and Question Answering |
9:00–11:00 |
QED: A Framework and Dataset for Explanations in Question Answering |
Poster 3K: Machine Translation and Multilinguality |
|
9:00–11:00 |
Rewriter-Evaluator Architecture for Neural Machine Translation |
9:00–11:00 |
BERTTune: Fine-Tuning Neural Machine Translation with BERTScore |
9:00–11:00 |
Modeling Bilingual Conversational Characteristics for Neural Chat Translation |
9:00–11:00 |
Importance-based Neuron Allocation for Multilingual Neural Machine Translation |
9:00–11:00 |
Transfer Learning for Sequence Generation: from Single-source to Multi-source |
9:00–11:00 |
A Closer Look at Few-Shot Crosslingual Transfer: The Choice of Shots Matters |
Poster 3L: Discourse and Pragmatics |
|
9:00–11:00 |
Coreference Reasoning in Machine Reading Comprehension |
9:00–11:00 |
Entity Enhancement for Implicit Discourse Relation Classification in the Biomedical Domain |
9:00–11:00 |
Adapting Unsupervised Syntactic Parsing Methodology for Discourse Dependency Parsing |
9:00–11:00 |
Unsupervised Pronoun Resolution via Masked Noun-Phrase Prediction |
Poster 3M: Syntax: Tagging, Chunking, and Parsing |
|
9:00–11:00 |
A Conditional Splitting Framework for Efficient Constituency Parsing |
9:00–11:00 |
A Unified Generative Framework for Various NER Subtasks |
9:00–11:00 |
An In-depth Study on Internal Structure of Chinese Words |
9:00–11:00 |
MulDA: A Multilingual Data Augmentation Framework for Low-Resource Cross-Lingual NER |
9:00–11:00 |
Lexicon Enhanced Chinese Sequence Labeling Using BERT Adapter |
Poster 3N: NLP Applications |
|
9:00–11:00 |
Math Word Problem Solving with Explicit Numerical Values |
9:00–11:00 |
Neural-Symbolic Solver for Math Word Problems with Auxiliary Tasks |
9:00–11:00 |
SMedBERT: A Knowledge-Enhanced Pre-trained Language Model with Structured Semantics for Medical Text Mining |
9:00–11:00 |
What is Your Article Based On? Inferring Fine-grained Provenance |
9:00–11:00 |
Cross-modal Memory Networks for Radiology Report Generation |
9:00–11:00 |
Controversy and Conformity: from Generalized to Personalized Aggressiveness Detection |
9:00–11:00 |
Multi-perspective Coherent Reasoning for Helpfulness Prediction of Multimodal Reviews |
9:00–11:00 |
Instantaneous Grammatical Error Correction with Shallow Aggressive Decoding |
9:00–11:00 |
Automatic ICD Coding via Interactive Shared Representation Networks with Self-distillation Mechanism |
9:00–11:00 |
PHMOSpell: Phonological and Morphological Knowledge Guided Chinese Spelling Check |
Poster 3O: Language Generation |
|
9:00–11:00 |
Guiding the Growth: Difficulty-Controllable Question Generation through Step-by-Step Rewriting |
9:00–11:00 |
Improving Encoder by Auxiliary Supervision Tasks for Table-to-Text Generation |
9:00–11:00 |
POS-Constrained Parallel Decoding for Non-autoregressive Generation |
9:00–11:00 |
Bridging Subword Gaps in Pretrain-Finetune Paradigm for Natural Language Generation |
9:00–11:00 |
TGEA: An Error-Annotated Dataset and Benchmark Tasks for TextGeneration from Pretrained Language Models |
9:00–11:00 |
Addressing Semantic Drift in Generative Question Answering with Auxiliary Extraction |
Poster 3P: Summarization |
|
9:00–11:00 |
Long-Span Summarization via Local Attention and Content Selection |
9:00–11:00 |
RepSum: Unsupervised Dialogue Summarization based on Replacement Strategy |
9:00–11:00 |
BASS: Boosting Abstractive Summarization with Unified Semantic Graph |
9:00–11:00 |
Capturing Relations between Scientific Papers: An Abstractive Model for Related Work Section Generation |
9:00–11:00 |
Focus Attention: Promoting Faithfulness and Diversity in Summarization |
9:00–11:00 |
Generating Query Focused Summaries from Query-Free Resources |
9:00–11:00 |
Demoting the Lead Bias in News Summarization via Alternating Adversarial Learning |
Poster 3Q: Question Answering |
|
9:00–11:00 |
DuReader_robust: A Chinese Dataset Towards Evaluating Robustness and Generalization of Machine Reading Comprehension in Real-World Applications |
9:00–11:00 |
Sequence to General Tree: Knowledge-Guided Geometry Word Problem Solving |
9:00–11:00 |
Robustifying Multi-hop QA through Pseudo-Evidentiality Training |
9:00–11:00 |
Multi-Scale Progressive Attention Network for Video Question Answering |
9:00–11:00 |
Efficient Passage Retrieval with Hashing for Open-domain Question Answering |
9:00–11:00 |
xMoCo: Cross Momentum Contrastive Learning for Open-Domain Question Answering |
9:00–11:00 |
Learn to Resolve Conversational Dependency: A Consistency Training Framework for Conversational Question Answering |
Poster 3R: Language Grounding to Vision, Robotics and Beyond |
|
9:00–11:00 |
PhotoChat: A Human-Human Dialogue Dataset With Photo Sharing Behavior For Joint Image-Text Modeling |
9:00–11:00 |
Good for Misconceived Reasons: An Empirical Revisiting on the Need for Visual Context in Multimodal Machine Translation |
9:00–11:00 |
Attend What You Need: Motion-Appearance Synergistic Networks for Video Question Answering |
9:00–11:00 |
Decoupling the Role of Data, Attention, and Losses in Multimodal Transformers |
Poster 3S: Information Extraction |
|
9:00–11:00 |
BERTifying the Hidden Markov Model for Multi-Source Weakly Supervised Named Entity Recognition |
9:00–11:00 |
CIL: Contrastive Instance Learning Framework for Distantly Supervised Relation Extraction |
9:00–11:00 |
SENT: Sentence-level Distant Relation Extraction via Negative Training |
9:00–11:00 |
An End-to-End Progressive Multi-Task Learning Framework for Medical Named Entity Recognition and Normalization |
9:00–11:00 |
PRGC: Potential Relation and Global Correspondence Based Joint Relational Triple Extraction |
9:00–11:00 |
Learning from Miscellaneous Other-Class Words for Few-shot Named Entity Recognition |
9:00–11:00 |
Joint Biomedical Entity and Relation Extraction with Knowledge-Enhanced Collective Inference |
9:00–11:00 |
Entity Concept-enhanced Few-shot Relation Extraction |
9:00–11:00 |
Fine-grained Information Extraction from Biomedical Literature based on Knowledge-enriched Abstract Meaning Representation |
9:00–11:00 |
Unleash GPT-2 Power for Event Detection |
9:00–11:00 |
Improving Model Generalization: A Chinese Named Entity Recognition Case Study |
9:00–11:00 |
CLEVE: Contrastive Pre-training for Event Extraction |
9:00–11:00 |
Three Sentences Are All You Need: Local Path Enhanced Document Relation Extraction |
9:00–11:00 |
Document-level Event Extraction via Parallel Prediction Networks |
9:00–11:00 |
StructuralLM: Structural Pre-training for Form Understanding |
Poster 3T: Sentiment Analysis, Stylistic Analysis, and Argument Mining |
|
9:00–11:00 |
Dual Graph Convolutional Networks for Aspect-based Sentiment Analysis |
9:00–11:00 |
Multi-Label Few-Shot Learning for Aspect Category Detection |
9:00–11:00 |
Argument Pair Extraction via Attention-guided Multi-Layer Multi-Cross Encoding |
9:00–11:00 |
A Neural Transition-based Model for Argumentation Mining |
11:00–12:00 |
Lifetime Award |
Session 14 |
Session 14A: Language Generation 2 (Session Chair: Shashi Narayan) |
14:00–14:10 |
Keep It Simple: Unsupervised Simplification of Multi-Paragraph Text |
14:10–14:20 |
Long Text Generation by Modeling Sentence-Level and Discourse-Level Coherence |
14:20–14:30 |
OpenMEVA: A Benchmark for Evaluating Open-ended Story Generation Metrics |
14:30–14:40 |
DYPLOC: Dynamic Planning of Content Using Mixed Language Models for Text Generation |
14:40–14:50 |
Controllable Open-ended Question Generation with A New Question Type Ontology |
14:50–15:00 |
BERTGen: Multi-task Generation through BERT |
Session 14B: Machine Translation and Multilinguality 9 (Session Chair: Lemao Liu) |
|
14:00–14:10 |
Selective Knowledge Distillation for Neural Machine Translation |
14:10–14:20 |
Measuring and Increasing Context Usage in Context-Aware Machine Translation |
14:20–14:30 |
Beyond Offline Mapping: Learning Cross-lingual Word Embeddings through Context Anchoring |
14:30–14:40 |
CCMatrix: Mining Billions of High-Quality Parallel Sentences on the Web |
14:40–14:50 |
EDITOR: an Edit-Based Transformer with Repositioning for Neural Machine Translation with Soft Lexical Constraints |
14:50–15:00 |
Gender Bias in Machine Translation |
Session 14C: Machine Learning for NLP 7 (Session Chair: Danqi Chen) |
|
14:00–14:10 |
Length-Adaptive Transformer: Train Once with Length Drop, Use Anytime with Search |
14:10–14:20 |
GhostBERT: Generate More Features with Cheap Operations for BERT |
14:20–14:30 |
Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization |
14:30–14:40 |
A Novel Estimator of Mutual Information for Learning to Disentangle Textual Representations |
14:40–14:50 |
Determinantal Beam Search |
14:50–15:00 |
Multi-hop Graph Convolutional Network with High-order Chebyshev Approximation for Text Reasoning |
Session 14D: NLP Applications 4 (Session Chair: Emily Prud'hommeaux) |
|
14:00–14:10 |
Accelerating Text Communication via Abbreviated Sentence Input |
14:10–14:20 |
Regression Bugs Are In Your Model! Measuring, Reducing and Analyzing Regressions In NLP Model Updates |
14:20–14:30 |
Detecting Propaganda Techniques in Memes |
14:30–14:37 |
Unsupervised Cross-Domain Prerequisite Chain Learning using Variational Graph Autoencoders |
14:37–14:44 |
Attentive Multiview Text Representation for Differential Diagnosis |
14:44–14:51 |
MedNLI Is Not Immune: Natural Language Inference Artifacts in the Clinical Domain |
Session 14E: Question Answering 4 (Session Chair: Sara Rosenthal) |
|
14:00–14:10 |
On the Efficacy of Adversarial Data Collection for Question Answering: Results from a Large-Scale Randomized Study |
14:10–14:20 |
Learning Dense Representations of Phrases at Scale |
14:20–14:30 |
End-to-End Training of Neural Retrievers for Open-Domain Question Answering |
14:30–14:40 |
Question Answering Over Temporal Knowledge Graphs |
14:40–14:47 |
Towards a more Robust Evaluation for Conversational Question Answering |
14:47–14:54 |
VAULT: VAriable Unified Long Text Representation for Machine Reading Comprehension |
Session 15 |
Session 15A: Language Generation 3 (Session Chair: Yangfeng Ji) |
15:00–15:10 |
Language Model Augmented Relevance Score |
15:10–15:20 |
DExperts: Decoding-Time Controlled Text Generation with Experts and Anti-Experts |
15:20–15:30 |
Polyjuice: Generating Counterfactuals for Explaining, Evaluating, and Improving Models |
15:30–15:40 |
Metaphor Generation with Conceptual Mappings |
15:40–15:50 |
Computational Framework for Slang Generation |
15:50–15:57 |
Avoiding Overlap in Data Augmentation for AMR-to-Text Generation |
Session 15B: NLP Applications 5 (Session Chair: Vincent Ng) |
|
15:00–15:10 |
Learning Latent Structures for Cross Action Phrase Relations in Wet Lab Protocols |
15:10–15:20 |
Multimodal Multi-Speaker Merger & Acquisition Financial Modeling: A New Task, Dataset, and Neural Baselines |
15:20–15:30 |
Mid-Air Hand Gestures for Post-Editing of Machine Translation |
15:30–15:40 |
Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning |
15:40–15:50 |
Joint Verification and Reranking for Open Fact Checking Over Tables |
15:50–15:57 |
Weakly-Supervised Methods for Suicide Risk Assessment: Role of Related Domains |
Session 15C: Resources and Evaluation 5 (Session Chair: Margot Mieskes) |
|
15:00–15:10 |
Evaluation of Thematic Coherence in Microblogs |
15:10–15:20 |
Neural semi-Markov CRF for Monolingual Word Alignment |
15:20–15:30 |
Privacy at Scale: Introducing the PrivaSeer Corpus of Web Privacy Policies |
15:30–15:40 |
The statistical advantage of automatic NLG metrics at the system level |
15:40–15:50 |
Are Missing Links Predictable? An Inferential Benchmark for Knowledge Graph Completion |
15:50–15:57 |
Can Transformer Models Measure Coherence In Text: Re-Thinking the Shuffle Test |
Session 15D: Summarization 2 (Session Chair: Fei Liu) |
|
15:00–15:10 |
ConvoSumm: Conversation Summarization Benchmark and Improved Abstractive Summarization with Argument Mining |
15:10–15:20 |
Improving Factual Consistency of Abstractive Summarization via Question Answering |
15:20–15:30 |
EmailSum: Abstractive Email Thread Summarization |
15:30–15:40 |
Cross-Lingual Abstractive Summarization with Limited Parallel Resources |
15:40–15:50 |
Dissecting Generation Modes for Abstractive Summarization Models via Ablation and Attribution |
15:50–15:57 |
SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization |
Session 15E: Semantics: Lexical Semantics 2 (Session Chair: Frank Ferraro) |
|
15:00–15:10 |
Learning Prototypical Functions for Physical Artifacts |
15:10–15:20 |
Verb Knowledge Injection for Multilingual Event Processing |
15:20–15:30 |
Dynamic Contextualized Word Embeddings |
15:30–15:40 |
Lexical Semantic Change Discovery |
15:40–15:50 |
Analysis and Evaluation of Language Models for Word Sense Disambiguation |
15:50–16:00 |
Let’s Play mono-poly: BERT Can Reveal Words’ Degree of Polysemy |
Session 16 |
Session 16A: Dialog and Interactive Systems 7 (Session Chair: Alan Ritter) |
16:00–16:10 |
Pretraining the Noisy Channel Model for Task-Oriented Dialogue |
16:10–16:20 |
The R-U-A-Robot Dataset: Helping Avoid Chatbot Deception by Detecting User Questions About Human or Non-Human Identity |
16:20–16:30 |
Conversation Graph: Data Augmentation, Training and Evaluation for Non-Deterministic Dialogue Management |
16:30–16:40 |
Using Meta-Knowledge Mined from Identifiers to Improve Intent Recognition in Conversational Systems |
16:40–16:50 |
Space Efficient Context Encoding for Non-Task-Oriented Dialogue Generation with Graph Attention Transformer |
16:50–17:00 |
DialogueCRN: Contextual Reasoning Networks for Emotion Recognition in Conversations |
Session 16B: Resources and Evaluation 6 (Session Chair: Bonnie Webber) |
|
16:00–16:10 |
Cross-replication Reliability - An Empirical Approach to Interpreting Inter-rater Reliability |
16:10–16:20 |
TIMEDIAL: Temporal Commonsense Reasoning in Dialog |
16:20–16:30 |
RAW-C: Relatedness of Ambiguous Words in Context (A New Lexical Resource for English) |
16:30–16:40 |
ARBERT & MARBERT: Deep Bidirectional Transformers for Arabic |
16:40–16:47 |
SaRoCo: Detecting Satire in a Novel Romanian Corpus of News Articles |
16:47–16:54 |
Bringing Structure into Summaries: a Faceted Summarization Dataset for Long Scientific Documents |
Session 16C: Semantics: Sentence-level Semantics, Textual Inference and Other areas 4 (Session Chair: Jonathan Berant) |
|
16:00–16:10 |
Improving Paraphrase Detection with the Adversarial Paraphrasing Task |
16:10–16:20 |
ADEPT: An Adjective-Dependent Plausibility Task |
16:20–16:30 |
ReadOnce Transformers: Reusable Representations of Text for Transformers |
16:30–16:40 |
Conditional Generation of Temporally-ordered Event Sequences |
16:40–16:50 |
Hate Speech Detection Based on Sentiment Knowledge Sharing |
Session 16D: Syntax: Tagging, Chunking, and Parsing 2 (Session Chair: Yannick Versley) |
|
16:00–16:10 |
Transition-based Bubble Parsing: Improvements on Coordination Structure Prediction |
16:10–16:20 |
SpanNER: Named Entity Re-/Recognition as Span Prediction |
16:20–16:30 |
Strong Equivalence of TAG and CCG |
16:30–16:40 |
StructFormer: Joint Unsupervised Induction of Dependency and Constituency Structure from Masked Language Modeling |
16:40–16:47 |
Replicating and Extending “Because Their Treebanks Leak”: Graph Isomorphism, Covariants, and Parser Performance |
Session 16E: Machine Translation and Multilinguality 10 (Session Chair: Preslav Nakov) |
|
16:00–16:10 |
Language Embeddings for Typology and Cross-lingual Transfer Learning |
16:10–16:20 |
Can Sequence-to-Sequence Models Crack Substitution Ciphers? |
16:20–16:30 |
Beyond Noise: Mitigating the Impact of Fine-grained Semantic Divergences on Neural Machine Translation |
16:30–16:40 |
Revisiting Negation in Neural Machine Translation |
16:40–16:50 |
Discriminative Reranking for Neural Machine Translation |
16:50–16:57 |
Don’t Rule Out Monolingual Speakers: A Method For Crowdsourcing Machine Translation Data |
Best Paper Session |
Best Paper Session |
23:00–23:03 |
Best Demo Paper |
23:03–23:16 |
Mind Your Outliers! Investigating the Negative Impact of Outliers on Active Learning for Visual Question Answering |
23:16–23:29 |
All That’s ’Human’ Is Not Gold: Evaluating Human Evaluation of Generated Text |
23:29–23:42 |
Scientific Credibility of Machine Translation Research: A Meta-Evaluation of 769 Papers |
23:42–23:55 |
Neural Machine Translation with Monolingual Translation Memory |
23:55–00:08 |
Intrinsic Dimensionality Explains the Effectiveness of Language Model Fine-Tuning |
00:08–00:21 |
UnNatural Language Inference |
00:21–00:39 |
Including Signed Languages in Natural Language Processing |
00:39–00:57 |
Vocabulary Learning via Optimal Transport for Neural Machine Translation |
Thursday, August 5, 2021 (all times UTC+0) |
|
01:00–01:30 |
Distinguished Service and Test-Of-Time Awards session |
01:30–02:00 |
Closing and Future Conferences |