OReFiL: an Online Resource Finder for Life Sciences
Overview
Authors
Affiliations
Background: Many online resources for the life sciences have been developed and introduced in peer-reviewed papers recently, ranging from databases and web applications to data-analysis software. Some have been introduced in special journal issues or websites with a search function, but others remain scattered throughout the Internet and in the published literature. The searchable resources on these sites are collected and maintained manually and are therefore of higher quality than automatically updated sites, but also require more time and effort.
Description: We developed an online resource search system called OReFiL to address these issues. We developed a crawler to gather all of the web pages whose URLs appear in MEDLINE abstracts and full-text papers on the BioMed Central open-access journals. The URLs were extracted using regular expressions and rules based on our heuristic knowledge. We then indexed the online resources to facilitate their retrieval and comparison by researchers. Because every online resource has at least one PubMed ID, we can easily acquire its summary with Medical Subject Headings (MeSH) terms and confirm its credibility through reference to the corresponding PubMed entry. In addition, because OReFiL automatically extracts URLs and updates the index, minimal time and effort is needed to maintain the system.
Conclusion: We developed OReFiL, a search system for online life science resources, which is freely available. The system's distinctive features include the ability to return up-to-date query-relevant online resources introduced in peer-reviewed papers; the ability to search using free words, MeSH terms, or author names; easy verification of each hit following links to the corresponding PubMed entry or to papers citing the URL through the search systems of BioMed Central, Scirus, HighWire Press, or Google Scholar; and quick confirmation of the existence of an online resource web page.
U-Index, a dataset and an impact metric for informatics tools and databases.
Callahan A, Winnenburg R, Shah N Sci Data. 2018; 5:180043.
PMID: 29557976 PMC: 5859919. DOI: 10.1038/sdata.2018.43.
A Survey of Bioinformatics Database and Software Usage through Mining the Literature.
Duck G, Nenadic G, Filannino M, Brass A, Robertson D, Stevens R PLoS One. 2016; 11(6):e0157989.
PMID: 27331905 PMC: 4917176. DOI: 10.1371/journal.pone.0157989.
Ozyurt I, Grethe J, Martone M, Bandrowski A PLoS One. 2016; 11(1):e0146300.
PMID: 26730820 PMC: 5156472. DOI: 10.1371/journal.pone.0146300.
Ambiguity and variability of database and software names in bioinformatics.
Duck G, Kovacevic A, Robertson D, Stevens R, Nenadic G J Biomed Semantics. 2015; 6:29.
PMID: 26131352 PMC: 4485340. DOI: 10.1186/s13326-015-0026-0.
bioNerDS: exploring bioinformatics' database and software use through literature mining.
Duck G, Nenadic G, Brass A, Robertson D, Stevens R BMC Bioinformatics. 2013; 14:194.
PMID: 23768135 PMC: 3693927. DOI: 10.1186/1471-2105-14-194.