Simplifying Lexical Simplification: Do We Need Simplified Corpora?

Author

Glavaš, Goran and Štajner, Sanja

Conference

Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers)

Year

2015

Figures & Tables

Table 1: Performance on the replacement task
Table 4: Example simplifications
Table 3: Human evaluation results
Table 2: SemEval-2012 Task 1 performance

Table of Contents

  • Abstract
  • 1 Introduction
  • 2 Related Work
  • 3 Resource-Light Lexical Simplification
    • 3.1 Simplification Candidate Selection
    • 3.2 Goodness-of-Simplification Features
    • 3.3 Simplification Algorithm
  • 4 Evaluation
    • 4.1 Replacement Task
    • 4.2 Ranking Task
    • 4.3 Human Evaluation
  • 5 Conclusion
  • Acknowledgements
  • References

References

  •  Susana Bautista, Pablo Gervás, and R. Ignacio Madrid.2009. Feasibility analysis for semi-automatic conversion of text to improve readability. In Proceedings of the Second International Conference on Information and Communication Technology and Accessibility (ICTA), pages 33–40.View this Paper
  • 36Or Biran, Samuel Brody, and Noémie Elhadad. 2011. Putting it simply: A context-aware approach to lexical simplification. In Proceedings of the ACL-HLT 2011, pages 496–501. ACL.View this Paper
  •   John Carroll, Guido Minnen, Yvonne Canning, Siobhan Devlin, and John Tait. 1998. Practical simplification of english newspaper text to assist aphasic readers. In Proceedings of AAAI-98 Workshop on Integrating Artificial Intelligence and Assistive Technology, pages 7–10.View this Paper
  • 4Jan De Belder and Marie-Francine Moens. 2010. Text simplification for children. In Proceedings of the SIGIR Workshop on Accessible Search Systems, pages 19–26.
  • 2Siobhan Devlin and John Tait. 1998. The use of a psy-cholinguistic database in the simplification of text for aphasic readers. Linguistic Databases, pages 161–173.
  •  Siobhan Devlin and Gary Unthank. 2006. Helping aphasic people process online information. In Proceedings of the 8th International ACM SIGACCESS Conference on Computers and Accessibility(ASSETS), pages 225–226. ACM.View this Paper
  •  Christiane Fellbaum. 1998. WordNet. Wiley Online Library.View this Paper
  •  Lijun Feng. 2009. Automatic readability assessment for people with intellectual disabilities. In ACM SIGACCESS Accessibility and Computing, number 93, pages 84–91. ACM.View this Paper
  •  Goran Glavaš and Sanja Štajner. 2013. Event-centered simplification of news stories. In Proceedings of the Student Workshop held in conjunction with RANLP,pages 71–78.View this Paper
  • 322Colby Horn, Cathryn Manduca, and David Kauchak.2014. Learning a lexical simplifier using wikipedia. In Proceedings of ACL 2014 (Short Papers), pages 458–463.View this Paper
  •  Sujay Kumar Jauhar and Lucia Specia. 2012. UOWSHEF: SimpLex – lexical simplicity ranking based on contextual and psycholinguistic features. In Proceedings of the SemEval-2012, pages 477–481. ACL.View this Paper
  •  Juan Martos, Sandra Freire, Ana González, David Gil,and Maria Sebastian. 2012. D2.1: Functional requirements specifications and user preference survey. Technical report, FIRST project.
  •   Jean-Baptiste Michel, Yuan Kui Shen, Aviva Presser Aiden, Adrian Veres, Matthew K. Gray, Joseph P. Pickett, Dale Hoiberg, Dan Clancy, Peter Norvig,and Jon Orwant. 2011. Quantitative analysis of culture using millions of digitized books. Science,331(6014):176–182.
  •  Jean-Baptiste Michel, Yuan Kui Shen, Aviva Presser Aiden, Adrian Veres, Matthew K. Gray, Joseph P. Pickett, Dale Hoiberg, Dan Clancy, Peter Norvig,and Jon Orwant. 2011. Quantitative analysis of culture using millions of digitized books. Science,331(6014):176–182.
  •   Adam Pauls and Dan Klein. 2011. Faster and smaller n-gram language models. In Proceedings of ACLHLT 2011, pages 258–267. ACL.View this Paper
  •  Adam Pauls and Dan Klein. 2011. Faster and smaller n-gram language models. In Proceedings of ACLHLT 2011, pages 258–267. ACL.View this Paper
  •   Jeffrey Pennington, Richard Socher, and Christopher D Manning. 2014. GloVe: Global vectors for word representation. In Proceedings of EMNLP 2014,pages 1532–1543.View this Paper
  •  Jeffrey Pennington, Richard Socher, and Christopher D Manning. 2014. GloVe: Global vectors for word representation. In Proceedings of EMNLP 2014,pages 1532–1543.View this Paper
  •   Sarah E. Petersen and Mari Ostendorf. 2007. Text simplification for language learners: A corpus analysis. In Proceedings of Workshop on Speech and Language Technology for Education (SLaTE).View this Paper
  •  Sarah E. Petersen and Mari Ostendorf. 2007. Text simplification for language learners: A corpus analysis. In Proceedings of Workshop on Speech and Language Technology for Education (SLaTE).View this Paper
  •   Luz Rello. 2012. DysWebxia: A Model to Improve Accessibility of the Textual Web for Dyslexic Users. In ACM SIGACCESS Accessibility and Computing.,number 102, pages 41–44. ACM, New York, NY,USA, January.View this Paper
  •  Luz Rello. 2012. DysWebxia: A Model to Improve Accessibility of the Textual Web for Dyslexic Users. In ACM SIGACCESS Accessibility and Computing.,number 102, pages 41–44. ACM, New York, NY,USA, January.View this Paper
  •   Horacio Saggion, Sanja Štajner, Stefan Bott, Simon Mille, Luz Rello, and Biljana Drndarevic. 2015. Making it simplext: Implementation and evaluation of a text simplification system for spanish. ACM Transactions on Accessible Computing, 6(4):14.View this Paper
  •  Horacio Saggion, Sanja Štajner, Stefan Bott, Simon Mille, Luz Rello, and Biljana Drndarevic. 2015. Making it simplext: Implementation and evaluation of a text simplification system for spanish. ACM Transactions on Accessible Computing, 6(4):14.View this Paper
  •   Matthew Shardlow. 2014. Out in the open: Finding and categorising errors in the lexical simplification pipeline. In Proceedings of LREC 2014, pages 1583–1590.View this Paper
  •  Matthew Shardlow. 2014. Out in the open: Finding and categorising errors in the lexical simplification pipeline. In Proceedings of LREC 2014, pages 1583–1590.View this Paper
  •   Lucia Specia, Sujay Kumar Jauhar, and Rada Mihalcea. 2012. SemEval-2012 Task 1: English lexical simplification. In Proceedings of the SemEval 2012,pages 347–355. ACL.View this Paper
  •   Lucia Specia, Sujay Kumar Jauhar, and Rada Mihalcea. 2012. SemEval-2012 Task 1: English lexical simplification. In Proceedings of the SemEval 2012,pages 347–355. ACL.View this Paper
  •   Kristian Woodsend and Mirella Lapata. 2011. Learning to simplify sentences with quasi-synchronous grammar and integer programming. In Proceedings of EMNLP 2011, pages 409–420. ACL.View this Paper
  •  Kristian Woodsend and Mirella Lapata. 2011. Learning to simplify sentences with quasi-synchronous grammar and integer programming. In Proceedings of EMNLP 2011, pages 409–420. ACL.View this Paper
  •   Sander Wubben, Antal Van Den Bosch, and Emiel Krahmer. 2012. Sentence simplification by monolingual machine translation. In Proceedings of ACL 2012 (Long Papers), pages 1015–1024. ACL.View this Paper
  •  Sander Wubben, Antal Van Den Bosch, and Emiel Krahmer. 2012. Sentence simplification by monolingual machine translation. In Proceedings of ACL 2012 (Long Papers), pages 1015–1024. ACL.View this Paper
  •   Mark Yatskar, Bo Pang, Cristian Danescu-NiculescuMizil, and Lillian Lee. 2010. For the sake of simplicity: unsupervised extraction of lexical simplifications from Wikipedia. In Proceedings of NAACL 2010, pages 365–368. ACL.View this Paper
  •   Mark Yatskar, Bo Pang, Cristian Danescu-NiculescuMizil, and Lillian Lee. 2010. For the sake of simplicity: unsupervised extraction of lexical simplifications from Wikipedia. In Proceedings of NAACL 2010, pages 365–368. ACL.View this Paper
  •  Alexander Yeh. 2000. More accurate tests for the statistical significance of result differences. In Proceedings of COLING 2000, pages 947–953. ACL.View this Paper
  •   Alexander Yeh. 2000. More accurate tests for the statistical significance of result differences. In Proceedings of COLING 2000, pages 947–953. ACL.View this Paper
  •  Zhemin Zhu, Delphine Bernhard, and Iryna Gurevych.2010. A monolingual tree-based translation model for sentence simplification. In Proceedings of the COLING 2010, pages 1353–1361. ACL.View this Paper
  •   Zhemin Zhu, Delphine Bernhard, and Iryna Gurevych.2010. A monolingual tree-based translation model for sentence simplification. In Proceedings of the COLING 2010, pages 1353–1361. ACL.View this Paper
+- Similar Papers (10)
+- Cited by (21)