Knowledge Extraction from the Language Extended Lexicon Glossary Using Natural Language Processing

Keywords: Requirement specification, knowledge representation, natural language, extended lexicon, software development

Abstract

Successful software development requires good specifications, that is, correct, consistent, and unambiguous requirements. Natural language artifacts are the most used tools for writing specifications; nevertheless, the use of natural language can introduce many defects such as ambiguity, vagueness, and generality. In light of this, the purpose of this paper is to propose an approach for writing good specifications and extracting knowledge from them. The proposed approach uses a particular lexicon, the Language Extended Lexicon (LEL) glossary, and suggests rules for extracting knowledge (concepts, attributes, and relationships) from it. This paper also presents a prototype that implements and supports the proposed approach, as well as a preliminary evaluation of the approach. In conclusion, the proposed approach represents a theoretical contribution, which was evaluated using the System Usability Scale, and its evaluation results are promising, as it obtained the maximum score.

Author Biographies

Leandro Antonelli, Universidad Nacional de La Plata, Argentina

Universidad Nacional de La Plata, Buenos Aires-Argentina, leandro.antonelli@lifia.info.unlp.edu.ar

Mario Lezoche, Université de Lorraine, Francia

Université de Lorraine, Nancy-France, mario.lezoche@univ-lorraine.fr

Juliana Delle Ville, Universidad Nacional de La Plata, Argentina

Universidad Nacional de La Plata, Buenos Aires-Argentina, juliana.delleville@lifia.info.unlp.edu.ar

References

Boehm, “Software Engineering,” IEEE Transactions on Computers, vol. C-25, no. 12, pp. 1226-1241, Dec. 1976. https://doi.org/10.1109/TC.1976.1674590

S. L. Lim, and A. Finkelstein, “StakeRare: Using Social Networks and Collaborative Filtering for Large-Scale Requirements Elicitation,” IEEE Transactions on Software Engineering, vol. 38, no. 3, pp. 707-735, May-June 2012. https://doi.org/10.1109/TSE.2011.36

C. Potts, “Using schematic scenarios to understand user needs,” in Proc. of the confe. on Desig. Inter. Syst. processes, practices, methods, & techniques - DIS ’95, Association for Computing Machinery, New York, NY, USA, 1995, 247–256. https://doi.org/10.1145/225434.225462

D. M. Berry, E. Kamsties, and M. M. Krieger, From Contract Drafting to Software Specification: Linguistic Sources of Ambiguity, v1.0. (2003). Accessed: Sep. 25, 2023. [Online]. Available: https://cs.uwaterloo.ca/~dberry/handbook/ambiguityHandbook.pdf

K. Pohl, “The three dimensions of requirements engineering: A framework and its applications,” Inf. Syst., vol. 19, no. 3, pp. 243–258, Apr. 1994. https://doi.org/10.1016/0306-4379(94)90044-2

A. Hall, “Seven myths of formal methods,” IEEE Software, vol. 7, no. 5, pp. 11-19, Sep. 1990. https://doi.org/10.1109/52.57887

Hoare, “An overview of some formal methods for program design,” Computer (Long Beach Calif.), vol. 20, no. 9, pp. 85–91, 1987. https://doi.org/10.1109/MC.1987.1663697

L. M. Cysneiros, and J. C. Sampaio do Prado Leite, “Using the Language Extended Lexicon to Support Non-Functional Requirements Elicitation,” in Anais do WER01 - Workshop em Engenharia de Requisitos, Buenos Aires, Argentina, 2001. https://www.researchgate.net/publication/221235247_Using_the_Language_Extended_Lexicon_to_Support_Non-Functional_Requirements_Elicitation

Z. S. Harris, “Co-Occurrence and Transformation in Linguistic Structure,” Language, vol. 33, no. 3, p. 283, Apr. 1957. https://doi.org/10.2307/411155

N. Chomsky. “The Logical Structure of Linguistic Theory,” Plenum Press, p. 573, 1975. https://dingo.sbs.arizona.edu/~langendoen/ReviewOfChomskyLSLT.pdf

L. Antonelli, M. Lezoche, and J. Delle Ville, “A Method to obtain a Knowledge Representation from a Natural Language Specification of the Domain using the Glossary LEL,” Presented at the Decissioning 2023, Popayan, Jun. 2023. https://host170.sedici.unlp.edu.ar/server/api/core/bitstreams/41a01f6d-54e6-4039-9056-996a6b47f50a/content

J. C. S. do P. Leite, and A. P. M. Franco, “A strategy for conceptual model acquisition,” in [1993] Proceedings of the IEEE International Symposium on Requirements Engineering, San Diego, CA, USA, 1993, pp. 243-246. https://doi.org/10.1109/ISRE.1993.324851

A. V. Vu, and M. Ogawa, “Formal semantics extraction from natural language specifications for ARM,” in Formal Methods – The Next 30 Years, M. H. ter Beek, A. McIver, and J. Oliveira, Eds., Switzerland: Springer, Cham, 2019, pp. 465–483. https://doi.org/10.1007/978-3-030-30942-8_28

S. Geetha, and G.S.A. Mala, “Extraction of key attributes from natural language requirements specification text,” in IET Chennai Fourth International Conference on Sustainable Energy and Intelligent Systems (SEISCON 2013), Chennai, India, 2013, p. 374-379. https://doi.org/10.1049/IC.2013.0341

F. Bargui, H. Ben-Abdallah, and J. Feki, “Multidimensional concept extraction and validation from OLAP requirements in NL,” in 2009 International Conference on Natural Language Processing and Knowledge Engineering, Dalian, China, 2009, pp. 1-8. https://doi.org/10.1109/NLPKE.2009.5313769

J. Kuchta, and P. Padhiyar, “Extracting Concepts from the Software Requirements Specification Using Natural Language Processing,” in 2018 11th International Conference on Human System Interaction (HSI), Gdansk, Poland, 2018, pp. 443-448. https://doi.org/10.1109/HSI.2018.8431221

Y. Rigou, D. Lamontagne, and I. Khriss, “A Sketch of a Deep Learning Approach for Discovering UML Class Diagrams from System’s Textual Specification,” in 2020 1st International Conference on Innovative Research in Applied Science, Engineering and Technology (IRASET), Meknes, Morocco, 2020, pp. 1-6. https://doi.org/10.1109/IRASET48871.2020.9092144

Y. Shen, and T. Breaux, “Domain Model Extraction from User-authored Scenarios and Word Embeddings,” in 2022 IEEE 30th International Requirements Engineering Conference Workshops (REW), Melbourne, Australia, 2022, pp. 143-151. https://doi.org/10.1109/REW56159.2022.00036

P. Szwed, “Concepts extraction from unstructured Polish texts: a rule based approach,” in Federated Conference on Computer Science and Information Systems, Lodz, Poland, 2015. https://doi.org/10.15439/2015F280

M. Li et al., “Automated Extraction of Requirement Entities by Leveraging LSTM-CRF and Transfer Learning,” in 2020 IEEE International Conference on Software Maintenance and Evolution (ICSME), Adelaide, SA, Australia, 2020, pp. 208-219. https://doi.org/10.1109/ICSME46990.2020.00029

Y. Wang, “Semantic information extraction for software requirements using semantic role labeling,” in 2015 IEEE International Conference on Progress in Informatics and Computing (PIC), Nanjing, China, 2015, pp. 332-337. https://doi.org/10.1109/PIC.2015.7489864

D. Sadoun, C. Dubois, Y. Ghamri-Doudane, and B. Grau, “From Natural Language Requirements to Formal Specification Using an Ontology,” in 2013 IEEE 25th International Conference on Tools with Artificial Intelligence, Herndon, VA, USA, 2013, pp. 755-760. https://doi.org/10.1109/ICTAI.2013.116

A. Schlutter, and A. Vogelsang, “Knowledge extraction from natural language requirements into a semantic relation graph,” in Proceedings of the IEEE/ACM 42nd International Conference on Software Engineering Workshops, New York, NY, USA, 2020, pp. 373-379. https://doi.org/10.1145/3387940.3392162

Y. An et al., “Exploring Pre-Trained Language Models to Build Knowledge Graph for Metal-Organic Frameworks (MOFs),” in 2022 IEEE International Conference on Big Data (Big Data), Osaka, Japan, 2022, pp. 3651-3658. https://doi.org/10.1109/BigData55660.2022.10020568

C. Arora, M. Sabetzadeh, L. Briand, and F. Zimmer, “Extracting domain models from natural-language requirements: Approach and industrial evaluation”, in Proceedings of the ACM/IEEE 19th International Conference on Model Driven Engineering Languages and Systems, Saint-Malo, 2016, pp. 250–260. https://doi.org/10.1145/2976767.2976769

G. Van Rossum. Python. v3.12.3. Accessed: Mar. 05, 2023. [Online]. Available: https://www.python.org/

A. Holovaty, and S. Willison. Django. v5.0.4. Accessed: Mar. 05, 2023. [Online]. Available: https://www.djangoproject.com/

A. Ronacher. Flask, v3.0.3. Accessed: Mar. 05, 2023. [Online]. Available: https://flask.palletsprojects.com/

M. Honnibal, and I. Montani. Spacy. v3.0. Accessed: Mar. 05, 2023. [Online]. Available: https://spacy.io/

S. Bird, E. Klein, and E. Loper. NLTK. v3.11. Accessed: Mar. 05, 2023. [Online]. Available: https://www.nltk.org/

A. Roques, PlantUML, v1.2024.4. Accessed: Feb. 27, 2023. [Online]. Available: https://plantuml.com/

J. Brooke, “SUS-A quick and dirty usability scale,” in Usability evaluation in industry, 1st ed. United Kingdom, CRC Press, 1996. https://www.taylorfrancis.com/chapters/edit/10.1201/9781498710411-35/sus-quick-dirty-usability-scale-john-brooke

J. Brooke, “SUS: a retrospective,” Journal of user experience, vol. 8, no. 2, pp.29-40, 2013. https://uxpajournal.org/sus-a-retrospective/

A. Bangor, P. T. Kortum, and J. T. Miller, “An empirical evaluation of the system usability scale,” Int. J. Hum. Comput. Interact., vol. 24, no. 6, pp. 574–594, Jul. 2008. https://doi.org/10.1080/10447310802205776

J. Nielsen, “Estimating the number of subjects needed for a thinking aloud test,” Int. J. Hum. Comput. Stud., vol. 41, no. 3, pp. 385–397, Sep. 1994. https://doi.org/10.1006/ijhc.1994.1065

S. McLellan, A. Muddimer, and S. Camille Peres, “The effect of experience on System Usability Scale ratings,” Journal of user experience, vol. 7, no. 2, pp. 56-67, 2012. https://uxpajournal.org/the-effect-of-experience-on-system-usability-scale-ratings/

How to Cite
[1]
L. Antonelli, M. Lezoche, and J. Delle Ville, “Knowledge Extraction from the Language Extended Lexicon Glossary Using Natural Language Processing”, TecnoL., vol. 27, no. 59, p. e2917, Apr. 2024.

Downloads

Download data is not yet available.
Published
2024-04-29
Section
Research Papers

Altmetric

Crossref Cited-by logo