Using semantic search in MesH and PubChem databases for entity linking
SEEK ID: https://fairdomhub.org/assays/1488
Modelling analysis
Projects: BioCreative VII
Investigation: Chemical Identification and Indexing in PubMed articles
Study: Chemical named entity recognition and annotation
Assay position:
Biological problem addressed: Annotation
Organisms: No organisms
Export PNG
Views: 304
Created: 12th Oct 2021 at 17:16
Last updated: 27th Sep 2023 at 14:21
This item has not yet been tagged.
Related items
Projects: SysMO DB, FAIRDOM, ICYSB 2015 - International Practical Course in Systems Biology, ZucAt, SysMO-LAB, Kinetics on the move - Workshop 2016, Example use cases, FAIRDOM user meeting, ErasysApp Funders, EraCoBiotech 2 nd call proposal preparation, Service to URV Tarragona, Spain with respect to their Safety Assessment of Endocrine Disrupting Chemicals model (Active NOW), FAIRDOM & LiSyM & de.NBI Data Structuring Training, MESI-STRAT, INCOME, Multiscale modelling of state transitions in the host-microbiome-brain network, BESTER, TRALAMINOL, Sustainable co-production, INDIE - Biotechnological production of sustainable indole, Extremophiles metabolsim, PoLiMeR - Polymers in the Liver: Metabolism and Regulation, OLCIR: Optimization of Lung Cancer Therapy with Ionizing Radiation, NAD COMPARTMENTATION, HOTSOLUTE, Stress granules, FAIRDOM Community Workers, GMDS Project Group "FAIRe Dateninfrastrukturen für die Biomedizinische Informatik", Mechanism based modeling viral disease ( COVID-19 ) dynamics in human population, COVID-19 Disease Map, AquaHealth (ERA-BlueBio), LiSyM Core Infrastructure and Management (LiSyM-PD), Early Metabolic Injury (LiSyM-EMI - Pillar I), Regeneration and Repair in Acute-on-Chronic Liver Failure (LiSyM-ACLF - Pillar III), Chronic Liver Disease Progression (LiSyM-DP - Pillar II), Liver Function Diagnostics (LiSyM-LiFuDi - Pillar IV), The Hedgehog Signalling Pathway (LiSyM-JGMMS), Multi-Scale Models for Personalized Liver Function Tests (LiSyM-MM-PLF), Model Guided Pharmacotherapy In Chronic Liver Disease (LiSyM-MGP), Molecular Steatosis - Imaging & Modeling (LiSyM-MSIM), Modelling COVID-19 epidemics, SNAPPER: Synergistic Neurotoxicology APP for Environmental Regulation, SCyCode The Autotrophy-Heterotrophy Switch in Cyanobacteria: Coherent Decision-Making at Multiple Regulatory Layers, SASKit: Senescence-Associated Systems diagnostics Kit for cancer and stroke, CC-TOP, BioCreative VII, MESI-STRAT Review, SDBV/HITS, MESI-Review 2024
Institutions: Heidelberg Institute for Theoretical Studies (HITS gGmbH), FAIRDOM User meeting, Norwegian University of Science and Technology, University of Rostock, University of Innsbruck
https://orcid.org/0000-0003-3540-0402Expertise: Genetics, Molecular Biology, Bioinformatics, Data Management, Transcriptomics, semantics, Curation, Ontology, Data Modelling
Tools: Cell and tissue culture, Databases, Chip-chip, BioMart, Protege, RightField, SEEK
I am a researcher at the Scientific Databases and Visualization Group at Heidelberg Institute for Theoretical Studies (HITS) , one of the developers of SabioRK - System for the Analysis of Biochemical Pathways - Reaction Kinetics (http://sabiork.h-its.org/) . I am working on design and maintenance of the information systems to store, query and analyse systems biology data; definition and implementation of methods for the integration of data from multiple sources. In **[SySMO-DB ...
Projects: FAIRDOM, BioCreative VII, The BeeProject, SDBV/HITS, Semantic Table Interpretation in Chemistry
Institutions: Heidelberg Institute for Theoretical Studies (HITS gGmbH)
https://orcid.org/0000-0002-7585-4479Expertise: Data analysis, Computational Systems Biology, Databases, Data Management, Table Curation
Tools: Machine Learning, Python, Java, standards, Data Integration
Projects that do not fall under current programmes.
Projects: Manchester Institute for Biotechnology, ICYSB 2015 - International Practical Course in Systems Biology, iRhythmics, INBioPharm, EmPowerPutida, Systo models, MycoSynVac - Engineering Mycoplasma pneumoniae as a broad-spectrum animal vaccine, Multiscale modelling of state transitions in the host-microbiome-brain network, Extremophiles metabolsim, NAD COMPARTMENTATION, Agro-ecological modelling, Bergen(Ziegler lab) project AF-NADase, NAMPT affinity, Stress granules, Modelling COVID-19 epidemics, Bio-crop, ORHIZON, Coastal Data, SASKit: Senescence-Associated Systems diagnostics Kit for cancer and stroke, hybrid sequencing, HOST-PAR, BioCreative VII, Boolean modeling of Parkinson disease map, Orphan cytochrome P450 20a1 CRISPR/Cas9 mutants and neurobehavioral phenotypes in zebrafish, Selective Destruction in Ageing, Viral Metagenomic, Synthetic biology in Synechococcus for bioeconomy applications (SynEco), testproject, SDBV ephemeral data exchanges, Test project, The BeeProject, PHENET, LiceVault, EbN1 Systems Biology, UMRPégase, DeCipher, Heat stress response of the red-tide dinoflagellate Prorocentrum cordatum, middle ear, datamgmt, Institut Pasteur's projects, The nucleus of Prorocentrum cordatum, qpcr, MRC-UNICORN, Test project for Sciender, qPCR, Artificial organelles_Pathogen digestion, Supplementary Information 2 associated with the manuscript entitled " Label free Mass spectrometry proteomics reveals different pathways modulated in THP-1 cells infected with therapeutic failure and drug resistance Leishmania infantum clinical isolates", FAIR Functional Enrichment, PTPN11 mutagenesis, Supplementary Information 2 associated with the manuscript entitled "Label free Mass spectrometry proteomics reveals different pathways modulated in THP-1 cells infected with therapeutic failure and drug resistance Leishmania infantum clinical isolates", iPlacenta- Placenta on a chip, Near Surface Wave-Coherent Measurements of Temperature and Humidity, A Meta-Analysis of Functional Recovery of Aphasia after Stroke by Acupuncture Combined with Language Rehabilitation Training, Phytoplankton phenology in the Bay of Biscay: using remote sensing to assess and raise awareness of climate change impacts on the sea, Master-BIDS, Endometriosis, Vitis Data Crop, MESI-STRAT Review, Establishing an innovative and transnational feed production approach for reduced climate impact of the aquaculture sector and future food supply, ARAX: a web-based computational reasoning system for translational biomedicine, Adaptation of Salmonella enterica, I AM FRONTIER, ., PhD Nicotinic Acetylcholine Receptors, SFB1361 playground, Amaizing, Conspicuous chloroplast with LHC‒PSI/II‒megacomplex and diverse PBPs in the marine dinoflagellate Prorocentrum cordatum, icpm-kth, SDBV/HITS, sample project, TestingSeek, Genomic Medicine, Remodeling of cIV, Virtual Human Platform for Safety Assessment, PROMISEANG, URGI, Matsutake, UNDESIRABLE EFFECTS OF POST COVID-19 VACCINATION: A DESCRIPTIVE STUDY, WINTER 2022, Semantic Table Interpretation in Chemistry, MS identification of L infantum proteins related to their drug resistance patterns for new drug targets identification and ecotoxicological evaluations of their environmental and interspecies impact, the Supplementary materials for paper, ToxiGen - Reproductive toxicity and transgenerational effects of petroleum mixtures in fish, PhotoBoost, Measurement of Fisheries Provisioning Services and its Pressure to Support Sustainability of Fisheries in The Jatigede Reservoir, Indonesia, FIsh data on 2022 in the Jatigede Reservoir, ImmPort - data sharing, MESI-Review 2024, REWIRED: comparative RNA-seq and ATAC-seq in six salmonids and six outgroup telest fishes, REWIRED, Data Repository, APPN Test Project, Enhanced Anticancer Effect of Thymidylate Synthase Dimer Disrupters Promoting Intracellular Accumulation, BIDS, BioRECIPE representation format, UMass Chan BioImage DMS Core_FAIR Metadata Templates, Function, control and engineering of microbial methylotrophy, Pectobacterium pangenome, New Optical Coherence Tomography Biomarkers Identified with Deep Learning for Risk Stratification of Patients with Age-related Macular Degeneration, Virulence-related genes expression in planktonic mixed cultures of Candida albicans and non-albicans Candida species, Screening of Secondary Plant Metabolites on Antihelmintic Activity in Ascaris scum, Munich Cluster for Systems Neurology, Test project May 2024, Biospecimen Collection Protocol, Winter Wheat (Triticum aestivum L.) Grain Yield, Quality, and Net Photosynthesis When Grown Under Semi-Transparent Cadmium Telluride Photovoltaic Modules Near Maturity, Benefit for All FAIR Data, Implementation of Nanopore Sequencing for Detection of Treatment Induced Transcriptomic and Epitranscriptomic Changes in Leukaemic Tumour Models, DPL, Glycogen Metabolism in bacteria, ILS Ceramide Ring Trial, Project Test, DeepCurate, Revisiting mutational resistance to ampicillin and cefotaxime in Haemophilus influenzae, Cancer Systems Biology Consortium (CSBC), Biochemical characterization of the feedforward loop between CDK1 and FOXM1 in epidermal stem cells, Drug Discovery and Biotechnology Standard Operating Procedures, EDITH (Ecosystem Digital Twins in Health) test project, Fluid flow project, Smart Garden Watering System, The role of different fatty acids, AQUACIRCLE
Web page: Not specified
BioCreative VII
Programme: Independent Projects
Public web page: Not specified
Organisms: Not specified
Current chemical concept recognition tools have demonstrated significantly lower performance for in full-text articles than in abstracts. Improving automated full-text chemical concept recognition can substantially accelerate manual indexing and curation and advance downstream NLP tasks such as relevant article retrieval. Participating in BioCreative Track NLM-Chem we focus identifying chemicals in full-text articles (i.e. named entity recognition and normalization).
Submitter: Olga Krebs
Studies: Chemical named entity recognition and annotation
Assays: Usage of fine-tuned BioBERT for identification of chemical entities, Using semantic search in MesH and PubChem databases for entity linking
Snapshots: No snapshots
Chemical named entity recognition (NER) is a significant pre-processing task in natural language processing. Identification and extraction of chemical entities from biomedical literature and entities linking to the knowledge base are essential steps for the chemical text-mining pipeline. However, the identification of chemical entities in a biomedical text is a challenging task due to the diverse morphology of chemical entities and the different types of chemical nomenclature. In this work, we ...
Submitter: Olga Krebs
Investigation: Chemical Identification and Indexing in PubMed ...
Assays: Usage of fine-tuned BioBERT for identification of chemical entities, Using semantic search in MesH and PubChem databases for entity linking
Snapshots: No snapshots
BC7 NLM-Chem-track data and materials