The Annotation data (corpus) for idiopathic pulmonary fibrosis (IPF) in brat format, which will be used for text-mining system


The network for the brat data will be shutdown for several minutes during the following periods:
from 16:30 J.S.T. November 22 to 15:00 J.S.T. November 25, 2024,
from 11:00 J.S.T. November 30 to 18:00 J.S.T. November 30, 2024,
from 16:00 J.S.T December 13 to 9:00 J.S.T. December 16, 2024,
from 3:00 J.S.T December 21 to 6:00 J.S.T December 21, 2024.
Sorry for inconvenience.

The links:

Annotation gudelines

Corpus, composed of 150 abstracts
IAA dataset

Config files for brat:
annotation config file for brat
visual config file for brat
tools config file for brat


Please cite the following paper:
Nagano, N., Tokunaga, N., Ikeda, M. et al. A novel corpus of molecular to higher-order events that facilitates the understanding of the pathogenic mechanisms of idiopathic pulmonary fibrosis. Sci Rep 13, 5986 (2023). doi: 10.1038/s41598-023-32915-8


Regarding the license, please see the following site:
License for the IPF corpus

The original brat tool is available at: brat rapid annotation tool (brat)
The latest brat tool is available at GitHub: brat rapid annotation tool (brat) in GitHub

With the brat tool and config files, the data files can be visualized more appropriately.
Moreover, this corpus for IPF was used to develop the following tools:
Tool of Named Entity Recognition for lung diseases
Tool of Entity Linking for lung diseases
Tool of Relation Extraction for lung diseases
Tool of Event Extraction for lung diseases
These tools may be useful for researchers and medical doctors for lung diseases.

This work was supported by the Public/Private R&D Investment Strategic Expansion PrograM (PRISM) in Japan.

Copyright: Artificial Intelligence Research Center (AIRC), National Institute of Advanced Industrial Science and Technology(AIST)