Publications

Links to papers and presentations produced by or relevant to the SAPIENTA project are listed below in the following categories (click on heading to jump to that section):

Journal Papers

Automatic recognition of conceptualisation zones in scientific articles and two life science applications


Maria Liakata, Shyamasree Saha, Simon Dobnik, Colin Batchelor and Dietrich Rebholz-Schuhmann
Bioinformatics 2012; doi: 10.1093/bioinformatics/bts071

A comparison and user-based evaluation of models of textual information structure in the context of cancer risk assessment.


Yufan Guo, Anna Korhonen, Maria Liakata, Ilona Silins, Johan Hogberg and Ulla Stenius. 2011. BMC Bioinformatics , 12:69

Conference Papers

Multi-label Annotation in Scientific Articles – The Multi-label Cancer Risk Assessment Corpus


James Ravenscroft, Maria Liakata, Anika Oellrich, and Shyamasree Saha, LREC 2016.

Partridge: An Effective System for the Automatic Classification of the Types of Academic Papers


James Ravenscroft, Maria Liakata, Amanda Clare. AI-SGAI 2013. Research and Development in Intelligent Systems XXX pp 351-358

A discourse-driven content model for summarising scientific articles evaluated in a complex question answering task


Liakata, M., Dobnik, S., Saha, S., Batchelor, C. R., & Rebholz-Schuhmann, D. (2013). A Discourse-Driven Content Model for Summarising Scientific Articles Evaluated in a Complex Question Answering Task. In EMNLP (pp. 747-757).

Identifying the Information Structure of Scientific Abstracts: An Investigation of Three Different Schemes


Yufan Guo, Anna Korhonen, Maria Liakata, Ilona Silins, Lin Sun, Ulla Stenius. In Proceedings of the 2010 Workshop on Biomedical Natural Language Processing (July 2010), pp. 99-107.

Zones of conceptualisation in scientific papers: a window to negative and speculative statements


Maria Liakata. 2010. Proceedings of the Workshop on Negation and Speculation in Natural Language Processing, July 2010, Uppsala, Sweden. 1-4. An overview of work on CoreSC automation and their usefulness in recognising negative and speculative statements.

Corpora for conceptualisation and zoning of scientific papers


Maria Liakata, Simone Teufel, Advaith Siddharthan and Colin Batchelor. 2010. Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC’10), Malta.
The paper showcases and compares two schemes for sentence based annotation of full scientific papers, with discourse based information. The schemes are CoreSC and AZ-II.

Semantic Annotation of Papers: Interface & Enrichment Tool (SAPIENT)


Liakata M., Q Claire and Soldatova L. N. (2009) In Proceedings of BioNLP 2009, Boulder, Colorado, pp 193–200.
This paper introduces SAPIENT.

Presentations

SAPIENT Automation Project


Overview presentation to JISC, May 2010

Other Papers

Guidelines for the annotation of General Scientific Concepts.


JISC Project Report.(2008). Liakata, M. and Soldatova, L.