Table Of Contents
1. Abstract. 4
2. Introduction. 5
3. The scientific problem 9
4. Materials and Methods. 9
4.1 MeSH (http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db mesh) 10
4.2 Using databases 10
4.3 Using Literature Discovery Tools 13
4.3.1 Arrowsmith. 14
4.3.2 BioRAT 16
4.3.3 BITOLA 18
4.3.4 Manjal 19
4.3.5 LitLinker 20
5. Evaluation and Interpretation. 22
6. Results 24
7. Discussion. 29
8. References 33
9. Table of Figures 35
10. Abbrevations 36
11. Acknowledgements 37
3
1. Abstract
Undoubtedly, the development of Conceptual Biology, which uses text-mining applications specific to biology is the only way to cope with the increasing amount of free textual data produced in this field. The increasing interest of users in efficiently retrieving and extracting relevant information, the need to keep up with new discoveries described in the literature or in biological databases, and the demands posed by the analysis of high throughput experiments, are the underlying forces motivating the development of Conceptual Biology tools, such as text-mining applications in molecular biology. Therefore the methods of Conceptual Biology have been used for this study to test the hypothesis, that genetically modified foods have no impact on public health. We studied the records of databases and those ones of Literature Based Discovery tools. After the binary scoring of the records with respect to their usefulness, they were also classified by their positive, neutral or negative conclusions with respect to the effect of genetically modified food on public health. In conclusion, we have to deny the hypothesis and therefore to state that genetically modified foods have an impact on public health. Further studies in conceptual biology may focus what kind of impact genetically modified food has on public health.
4
2. Introduction
“Knowledge can be created by drawing inference from what is already known.” -Davies, R. 1989-
Theabundance of electronically accessible texts is rising exponentially throughout the last decade. A vast amount of digital information - especially in molecular biology and genetics- is seemingly an auspicious resource for conceptual biology. In the demands of biomedical or biochemical investigators for sources or references, librarians and information specialist are commonly puzzled. The increasing amount of scientific journals, with an even greater number of articles per journal, expands already in humongous bibliographic databases. The rapid and persistent augmentation in the number of biological, biomedical and even genetics publications gives rise to the desperate situation, that researchers can no longer read more than minute proportion of the literature in their field. Dealing with the substantial quantity of information has induced a fragmentation of scientific literature (Ganiz et al. 2005), that exists within:
1. specialities: advances in the research field e.g. modern comforts in biophysics or mathematical physics
2. sub-specialities: subordinated field in the research field e.g. proteomics or aquatic toxicology
3. structure: structure that occurs in the research field e.g. blood, cell or lipid 4. technique: special techniques that can be found in the research field e.g. mass spectrometry or gel electrophoresis
This specialisation or fragmentation of scientific literature leads to an insuperable border and furthermore to an increasing problem in science, particularly with regard to biomedicine (Swanson 2001).
Scientists incline the correspondence more within their fragments than with the scientific field´s farther community, enhancing the lack of communication between specialities (Swanson et al. 1997). This argument is proven within the citations of literature of authors, that cite heavily those of their own narrow specialities.
5
Thus, scientists may never be aware of the published data and results of others´ relevant work. In addition, this gives rise to useful and important connections between fragmented and implicit data, but yet unnoticed.
Figure 1. Swanson´s discovery. The relationships AB and BC are known and reported in the literature. The implicit relationship AC is a putative new discovery (Weeber et al. 2001) 1 Swanson´s serendipitous literature-based discovery of a cure for Raynaud´s didease by dietary fish oil. The literature for both issues were disjoint. If these scientific fields had been aware of each other, the cure would have been found much earlier than Swanson´s discovery
For this reason, conceptual biology allows one to encompass, without limitation as many fields of science as necessary. This is important because one cannot work experimentally in every scientific field. It is predicted that the most important changes in cellular and molecular biology will be conceptual. In turn it will be conceptual biology, supporting a need for data collection and phenomenological publications, but, and that is the most important thing, how these collections and publications are connected and related.
Hence, Informational Retrieval and the conventional computer-aided literature searching represent insufficient techniques for recognizing useful connections. Thus, this leads to Literature Based Discovery (LBD), that directly addresses the limits of knowledge discovery. LBD is a tool, that gives rise to occurrence of novel connections, that have yet not been published.
The concept´s principle is based on the hypothesis that “wealth of recorded knowledge is greater than the sum of its parts.” (Davies 1989). As a precursor in LBD, Don R. Swanson introduced in 1986 his concept of discovering new associations within a bibliographic database. Furthermore he has asserted hypothesis that have been published in various articles (Swanson 1986; 1988; 1990; Smalheiser & Swanson 1996, 1998). According to his statement LBD is a process, finding complementary structures in apparently disjoint literature.
1 In different works disease can be defined as A and substance can be defined as C. Thus, the search can either start from disease or substance.
6
These complementary structures derive from two discrete arguments, yielding novel and important inferences and insights, when combined. Those discrete arguments, that do not mention, cite or co-cite each other, are defined as “disjoint” arguments (Fig.1, p.7)
To specify the main purpose of LBD, I will focus the relevant traits:
1. LBD avails present knowledge from published science literature (e.g. Medline) 2. LBD is a process that strives to find relations between two disjoint arguments (e.g.” high blood viscosity” and “platelet aggregation” are mentioned arguments in separate literature of Fish Oil and Raynaud´s Disease) 3. The combination of these arguments may obtain a new non obvious insight 4. Any connection made should be novel and previously unpublished (e.g. no publication ever mentioned Fish Oil and Raynaud´s Disease together)
An intricacy of LBD, because it comprises two types of entities: concept and literature.
But there are three more serious obstacles for LBD. First, there is seeminlgy unmanageable information space with many potential relations due to the vast amount of data and text. Second, the language itself represents in an unstructured format with characteristic grammar and semantic - even more there embedded in different languages.
Third, the lack of standardized vocabulary inhibits the process to formally define various LBD techniques. Swanson´s example reflects, that we could discover new
7
knowledge from available existing text, if we can assemble the pieces of existing knowledge in the right way.
Conceptual Biology is not a common term in the field of biochemical and biomedical research. Indeed, it leads to important information, enveloped in the overwhelming multitude of literature. Thus, researchers could pose the question of a comprehensible definition of what conceptual biology is really about. Mikhail Blagosklonny hit the bull´s eye, when he says, that a conceptual biologist “…can generate a hypothesis in which predictions are formulated in testable terms, and then search for relevant information among published reports of experiments that may have had a different purpose altogether.” (Blagosklonny et al. 2002). In the common field of biochemical investigations, “one is not licensed to theorize without providing new data” (Blagosklonny et al. 2002), but according to D. Bray (Bray 2001) this “is a sociological problem and not a scientific one.” Furthermore conceptual biology is an important and irreplaceable complement to the accepted empirical biology in part, because researchers struggle to maintain expertise and management in their fields and even more to understand the connections between different research fields that could reveal fundamental new facts, embedded in the overproduced data field.
Hypothesis testing is central to the process of scientific discovery, and experimental design is a common methodology in the evidence gathering part of hypothesis testing. But some sort of what is commonly now known as ‘data mining’ methodologies can and always have been used too for the data gathering part of hypothesis testing. However, and maybe even ironically, data mining recently has been confined by the vast expansion of knowledge that is increasingly stored in specialized databases and formats as will be explained bellow. To avert these limitations, new developments have emerged in the form of what has been called Conceptual Biology.
With specialized tools and methodologies which are also explained below, Conceptual Biology allows seemingly unlikely hypothesis testing to be performed, examples of which will be given bellow also.
8
Arbeit zitieren:
Yvonne Papadopoulos, 2006, Testing a hypothesis with the methods of conceptual biology, München, GRIN Verlag GmbH
Dieser Text kann über folgende URL aufgerufen und zitiert werden:
Einbetten
DOI
Formatvorlage (Microsoft Word) für eine Diplomarbeit, Masterarbeit, Ha...
Für MS Word 2003 - Update 2010
Vorlagen, Muster, Formulare, Infobroschüren
Ausarbeitung, 25 Seiten
Formatvorlage (OpenOffice) für eine Diplomarbeit, Masterarbeit, Hausar...
Vorlagen, Muster, Formulare, Infobroschüren
Ausarbeitung, 35 Seiten
Formatvorlage / Vorlage zur Erstellung einer Diplomarbeit, Bachelorarb...
Vorlagen, Muster, Formulare, Infobroschüren
Ausarbeitung, 15 Seiten
Formatvorlage / Vorlage für eine Diplomarbeit / Hausarbeit
Für MS Word 2007 - dotx
Vorlagen, Muster, Formulare, Infobroschüren
Ausarbeitung, 25 Seiten
Anleitung zum Erstellen schriftlicher Arbeiten: Der Aufbau einer wisse...
Vorlagen, Muster, Formulare, Infobroschüren
Ausarbeitung, 20 Seiten
Erstellen einer schriftlichen Hausarbeit
Vorlagen, Muster, Formulare, Infobroschüren
Hausarbeit, 14 Seiten
Grundtechniken wissenschaftlichen Arbeitens
Bibliografieren - Reden - Schr...
Vorlagen, Muster, Formulare, Infobroschüren
Skript, 46 Seiten
Ratgeber zur Erstellung wissenschaftlicher Arbeiten. Diplomarbeiten - ...
Vorlagen, Muster, Formulare, Infobroschüren
Ausarbeitung, 39 Seiten
Yvonne Papadopoulos hat den Text Testing a hypothesis with the methods of conceptual biology veröffentlicht
Yvonne Papadopoulos hat einen neuen Text hochgeladen
Thwarting Consumer Choice: The Case Against Mandatory Labeling for Gen...
Gary E. Marchant, Guy A. Cardineau, Thomas P. Redick
Genetically Modified Food: A Short Guide for the Confused
A Short Guide for the Confused
Andy Rees
Market Development for Genetically Modified Foods
V. Santaniello, R. E. Evenson, D. Zilberman
0 Kommentare