NELL’s subcategories from a question answering environment

  • Wesley W. O. Souza UFSCAR
  • Diorge Brognara UFSCAR
  • João A. Leite UFSCAR
  • Estevam R. Hruschka Jr. UFSCAR


With advances in machine learning, natural language processing, processing speed, and amount of data storage, conversational agents are being used in applications that were not possible to perform within a few years. NELL, a machine learning agent who learns to read the web, today has a considerably large ontology and while it can be used for multiple fact queries, it is also possible to expand it further and specialize its knowledge. One of the first steps to succeed is to refine existing knowledge in NELL’s knowledge base so that future communication between it and humans is as natural as possible. This work describes the results of an experiment where we investigate which machine learning algorithm performs best in the task of classifying candidate words to subcategories in the NELL knowledge base.


Bird, S., Klein, E., Loper, E., and Baldridge, J. (2008). Multidisciplinary instruction with the natural language toolkit. In Proceedings of the Third Workshop on Issues in Teaching Computational Linguistics, pages 62–70. Association for Computational Linguistics.

Bradley, M. M. and Lang, P. J. (1999). Affective norms for english words (anew): Instruction manual and affective ratings. Technical report, Citeseer.

Carlson, A., Betteridge, J., Kisiel, B., Settles, B., Hruschka Jr, E. R., and Mitchell, T. M. (2010). Toward an architecture for never-ending language learning. In AAAI, volume 5, page 3.

Dalvi, B., Cohen, W. W., and Callan, J. (2013). Classifying entities into an incomplete ontology. In Proceedings of the 2013 workshop on Automated knowledge base construction, pages 31–36. ACM.

Dalvi, B., Minkov, E., Talukdar, P. P., and Cohen, W. W. (2015). Automatic gloss finding for a knowledge base using ontological constraints. In Proceedings of the Eighth ACM International Conference on Web Search and Data Mining.

Mohamed, T., Hruschka, E., and Mitchell, T. (2011). Discovering relations between noun categories. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, pages 1447–1455, Edinburgh, Scotland, UK. Association for Computational Linguistics.

Noy, N. F. and Klein, M. (2004). Ontology evolution: Not the same as schema evolution. Knowledge and information systems, 6(4):428–440.

Pedro, S. D., Appel, A. P., and Hruschka Jr, E. R. (2013). Autonomously reviewing and validating the knowledge base of a never-ending learning system. In Proceedings of the 22nd International Conference on World Wide Web, pages 1195–1204. ACM.

Pedro, S. D. and Hruschka Jr, E. R. (2012). Conversing learning: Active learning and active social interaction for human supervision in never-ending learning systems. In Ibero-American Conference on Artificial Intelligence, pages 231–240. Springer.

Settles, B. (2011). Closing the loop: Fast, interactive semi-supervised annotation with queries on features and instances. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, pages 1467–1478, Edinburgh, Scotland, UK. Association for Computational Linguistics.

Souza, W. W. O. and Hruschka Jr, E. R. (2016). Cognitive conversation language-ccl. In International Conference on Intelligent Systems Design and Applications, pages 309– 318. Springer.

Sriram, B., Fuhry, D., Demir, E., Ferhatosmanoglu, H., and Demirbas, M. (2010). Short text classification in twitter to improve information filtering. In Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval, pages 841–842. ACM.

Stojanovic, L. (2004). Methods and tools for ontology evolution.

Tausczik, Y. R. and Pennebaker, J.W. (2010). The psychological meaning of words: Liwc and computerized text analysis methods. Journal of language and social psychology, 29(1):24–54.

Yang, B. and Mitchell, T. (2016). Joint extraction of events and entities within a document context. In Proceedings of the 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL).

Zablith, F., Antoniou, G., d’Aquin, M., Flouris, G., Kondylakis, H., Motta, E., Plexousakis, D., and Sabou, M. (2015). Ontology evolution: a process-centric survey. The knowledge engineering review, 30(1):45–75.

Zhang, X., Zhao, J., and LeCun, Y. (2015). Character-level convolutional networks for text classification. In Cortes, C., Lawrence, N. D., Lee, D. D., Sugiyama, M., and

Garnett, R., editors, Advances in Neural Information Processing Systems 28, pages 649–657. Curran Associates, Inc.

SOUZA, Wesley W. O.; BROGNARA, Diorge; LEITE, João A.; HRUSCHKA JR., Estevam R.. NELL’s subcategories from a question answering environment. In: ENCONTRO NACIONAL DE INTELIGÊNCIA ARTIFICIAL E COMPUTACIONAL (ENIAC), 15. , 2018, São Paulo. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2018 . p. 883-891. ISSN 2763-9061. DOI: