Two biomedical sublanguages a description based on the theories of Zellig Harris
【精品】翻译综合
一个抑制肿瘤的连续模型-------艾丽斯H伯杰,阿尔弗雷德G. Knudson 与皮埃尔保罗潘多尔菲今年,也就是2011 年,标志着视网膜母细胞瘤的统计分析的第四十周年,首次提供了证据表明,肿瘤的发生,可以由两个突变发起。
这项工作提供了“二次打击”的假说,为解释隐性抑癌基因(TSGs)在显性遗传的癌症易感性综合征中的作用奠定了基础。
然而,四十年后,我们已经知道,即使是部分失活的肿瘤抑制基因也可以致使肿瘤的发生。
在这里,我们分析这方面的证据,并提出了一个关于肿瘤抑制基因功能的连续模型来全方位的解释肿瘤抑制基因在癌症过程中的突变。
虽然在1900 年之前癌症的遗传倾向已经被人认知,但是,是在19 世纪曾一度被忽视的孟德尔的遗传规律被重新发现之后,癌症的遗传倾向才更趋于合理化。
到那时,人们也知道,肿瘤细胞中的染色体模式是不正常的。
接下来对癌症遗传学的理解做出贡献的人是波威利,他提出,一些染色体可能刺激细胞分裂,其他的一些染色体 a 可能会抑制细胞分裂,但他的想法长期被忽视。
现在我们知道,这两种类型的基因,都是存在的。
在这次研究中,我们总结了后一种类型基因的研究历史,抑癌基因(TSGs),以及能够支持完全和部分失活的肿瘤抑制基因在癌症的发病中的作用的证据。
我们将抑制肿瘤的连续模型与经典的“二次打击”假说相结合,用来说明肿瘤抑制基因微妙的剂量效应,同时我们也讨论的“二次打击”假说的例外,如“专性的单倍剂量不足”,指出部分损失的抑癌基因比完全损失的更具致癌性。
这个连续模型突出了微妙的调控肿瘤抑制基因表达或活动的重要性,如微RNA(miRNA)的监管和调控。
最后,我们讨论了这种模式在┲⒌恼锒虾椭瘟乒 讨械挠跋臁!岸 未蚧鳌奔偎?第一个能够表明基因的异常可以导致癌症的发生的证据源自1960 年费城慢性粒细胞白血病细胞的染色体的发现。
后来,在1973 年,人们发现这个染色体是是第9 号和第22 号染色体异位的结果,并在1977 年,在急性早幼粒细胞白血病患者中第15 号和第17 号染色体易位被识别出来。
人类基因组计划名词解释生物信息学
人类基因组计划名词解释生物信息学英文回答:Bioinformatics.Bioinformatics is a field that combines biology, computer science, and information technology. It involves the development and use of computational tools and techniques to manage, analyze, and interpret biological data. Bioinformatics is used in a wide range of research areas, including genomics, proteomics, drug discovery, and disease diagnosis.Key concepts in bioinformatics.Genomics: The study of the structure and function of genomes.Proteomics: The study of the structure and function of proteins.Transcriptomics: The study of the structure and function of transcripts.Metabolomics: The study of the structure and function of metabolites.Bioinformatics databases: Databases that store and manage biological data.Bioinformatics tools: Software tools that are used to analyze and interpret biological data.Applications of bioinformatics.Drug discovery: Bioinformatics is used to identify new drug targets and to design new drugs.Disease diagnosis: Bioinformatics is used to develop new diagnostic tests for diseases.Personalized medicine: Bioinformatics is used todevelop personalized treatment plans for patients.Evolutionary biology: Bioinformatics is used to study the evolution of species.Challenges in bioinformatics.Data explosion: The amount of biological data is growing rapidly, making it difficult to manage and analyze.Data integration: Biological data is often stored in different formats and in different databases, making it difficult to integrate and analyze.Algorithm development: New algorithms are needed to analyze and interpret complex biological data.Despite these challenges, bioinformatics is a rapidly growing field with the potential to revolutionize the way we understand and treat diseases.中文回答:生物信息学。
双功能mrna名词解释
双功能mrna名词解释双功能mRNA的英文名称是"bifunctional mRNA",意思是具有双重功能的信使RNA。
1. This bifunctional mRNA encodes a protein that acts as both an enzyme and a regulatory factor. (这种双功能mRNA编码了一种既是酶又是调节因子的蛋白质。
)2. The discovery of bifunctional mRNA has shed light on the complex mechanisms of gene regulation. (对双功能mRNA的发现揭示了基因调控的复杂机制。
)3. Bifunctional mRNA plays a critical role in coordinating cellular processes by simultaneously carrying genetic information and executing enzymatic functions. (双功能mRNA通过同时携带遗传信息和执行酶功能,在协调细胞过程中起着关键作用。
)4. The identification of specific motifs within bifunctional mRNA has allowed scientists to study its dualroles in gene expression and enzymatic activity. (对双功能mRNA内特定结构的识别使科学家能够研究其在基因表达和酶活性中的双重作用。
)5. Researchers are investigating the therapeuticpotential of bifunctional mRNA in various diseases, aiming to utilize its dual functions for targeted treatment strategies. (研究人员正在探索双功能mRNA在各种疾病中的治疗潜力,旨在利用其双重功能进行靶向治疗策略。
英文外刊,抗击疟疾的科学家们,陷入了生物伦理学的争论
英文外刊,抗击疟疾的科学家们,陷入了生物伦理学的争论Scientists at this lab in Burkina Faso have deployed gene warfare against the parasite carrying mosquitoes that spread malaria.布基纳法索一个实验室的科学家已经对传播疟疾同时携带寄生虫的蚊子进行了基因改造。
The conventional tools at our disposal today have reached a ceiling and can't become more efficient than they are right now.我们现在使用的传统工具已经达到了极限,不能比现在的效率更高。
We have no choice but to look at complementary methods.我们别无选择,只能寻找辅助性疗法。
That is why we're using genetically modified mosquitoes.这就是我们对蚊子进行转基因的原因。
Professor Diabate runs the experiment for target malaria, a research consortium backed by the Bill and Melinda Gates Foundation.迪亚巴特教授为目标疟疾组织(比尔和梅琳达.盖茨基金会支持的研究联盟)开展了这项实验。
The group developed an enzyme that sterilizes male mosquitoes.研究小组研发出一种可以使雄蚊绝育的酶,可以使雄蚊绝育。
The action of the enzyme continues after fertilization which means if the male copulates with a female, the embryo is dead and the female can no longer have offspring.这种酶在雌蚊子受精后继续发挥作用,这意味着如果雄蚊子与雌蚊子交配,胚胎就会死亡,雌蚊子就不能再生育后代。
早、晚期双语者第二语言书写的脑功能定位及机制
万方数据万方数据圈lEAHP组英语书写与假写比较时引起脑葡萄糖代谢增加的脑表面图圈2LAHP组英语书写与假写比较时引起脑葡萄糖代谢增加的脑表面图注:红色区域示葡萄糖代谢激活的区域田3LAHP组与EAHP组英语书写与假写比较时引起脑葡萄糖代谢增加的脑表面图被动的角色,可发出信号到其他负责解决矛盾的调节网络,如额顶区等。
结合既往研究”引,本试验中两组L2书写均激活左扣带回,提示扣带回可能参与主动书写中的注意活动,起到矛盾调节的中转站。
试验结果揭示EAHP组L2书写激活双侧小脑。
LAHP组L2书写激活右侧小脑(t=21.97),激活程度明显高于EAHP组。
两组比较则激活双侧小脑,但较两组各自比较激活范围更广泛。
考虑小脑通过皮质-脑桥.小脑系统的前馈控制环路接受额、顶、枕、颞叶的投射纤维。
反过来,小脑通过小脑.丘脑.皮质系统将已接受幕上语言加工的信息反馈回皮层区,从而积极主动的参与语言的组织、构建和执行"J。
EAHP组和LAHP组L2书写均激活皮层下结构。
本试验英文书写内容为自我介绍,属于陈述性记忆;而书写本身又是一项技巧性的动作,属于程序性记忆。
因此除了负责陈述性记忆的颞顶区的激活外,还可以观察到负责程序性记忆的基底神经节和额叶的广泛激活。
而除了记忆与这些区域相关外,语言也与这些区域相关。
基底神经节可通过皮质一纹状体环路及其内部不同结构之间的纤维联系来参与语言的加工和产生,而既往研究也提供了基底神经节参与书写的证据[8。
111。
本研究尝试以书写为范式来观察语言习得年龄对L2脑功能定位的影响,EALP组均于4.5岁前学习英语,而LALP组在12岁以后接触英语。
EAHP组和LAHP组的进行L2书写时的脑功能定位既有相同之处又有不同之处,但LAHP组的脑区激活相对广泛(包括右半球)且相关区域脑代谢较EAHP组增高,如边缘叶、颞叶、小脑等脑区的激活,说明在语言熟练程度大致相同的情况下,语言习得年龄影响L2的脑功能定位的范围和程度。
汉字识别
Sublexical processing in visual recognition of Chinese characters:Evidence from repetition blindness forsubcharacter componentsSu-Ling Yeh *and Jing-Ling LiDepartment of Psychology,National Taiwan University,No.1,Sec.4,Roosevelt Rd.,Taipei 106,Taiwan,ROCAccepted 13May 2003AbstractRepetition blindness (RB)refers to the failure to detect the second occurrence of a repeated item in rapid serial visual presen-tation (RSVP).In two experiments using RSVP,the ability to report two critical characters was found to be impaired when these two characters were identical (Experiment 1)or similar by sharing one repeated component (Experiment 2),as opposed to when they were different characters with no common components.RB for the whole character occurred when the exposure duration was more than 50ms with one intervening character between the two critical characters (lag ¼1),whereas RB for subcharacter components was more evident at exposure durations shorter than 50ms with no intervening character (lag ¼0).These results provide support for the model of sublexical processing in Chinese character recognition.Ó2003Elsevier Science (USA).All rights reserved.Keywords:Visual word recognition;Chinese character;Component;Analytic processing;Repetition blindness;Sublexical processing1.IntroductionChinese characters are the basic writing units in Chinese script.They are different from linearly arranged alphabetic words in that each character consists of sev-eral parts combined into various structures,including left–right(e.g.,),up–down (e.g.,),P-shaped(e.g.,),L-shaped(e.g.,),and enclosed (e.g.,)structures (Yeh,Li,&Chen,1997;Yeh,Li,&Chen,1999;Yeh,2000),and in that each character occupies a constant square-shaped space,irrespective of its structure and the number of strokes in the character.These aspects of Chinese writing have led to the holistic processing view,as it would then seem more efficient to also treat each same-sized character as a perceptually distinct unit for visual recognition (e.g.,evidence from Chen,1984;Cheng,1981;Tzeng,Hung,Cotton,&Wang,1979;Yu,Feng,Cao,&Li,1990).Here we report,instead,evi-dence for an analytic processing view by adopting the repetition blindness paradigm (RB,Kanwisher,1987).RB is the failure to detect or report the second oc-currence of a repeated item in rapid serial visual pre-sentation (RSVP)when the repeated items are presented at a rate of 7–12item/s,with a temporal lag of 0–3in-tervening items.It has been shown that RB occurs for various stimulus types,including words presented in sentences and lists (Kanwisher,1987;Kanwisher &Potter,1990),letters presented in words,lists,and spa-tial arrays (Kanwisher,1991;Kanwisher &Potter,1990;Kanwisher,Driver,&Machado,1995;Park &Kanw-isher,1994),colors (Kanwisher,1991;Kanwisher et al.,1995),nonobject pictures (Arnell &Jolicoeur,1997),object pictures (Kanwisher,Yin,&Wojciulik,1999),sentences consisting of words &pictures (Bavelier,1994),and homophones (Bavelier &Potter,1992;Bavelier,Prasada,&Segui,1994).According to a leading hypothesis proposed by Kanwisher and her colleagues (the token individuation failure hypothesis ,Kanwisher,1987,1991;Kanwisher &Potter,1990;Park &Kanwisher,1994),RB results from*Corresponding author.Fax:+11-886-2-23629909.E-mail address:suling@.tw (S.-L.Yeh).0093-934X/$-see front matter Ó2003Elsevier Science (USA).All rights reserved.doi:10.1016/S0093-934X(03)00146-9Brain and Language 88(2004)47–53/locate/b<he failure to initiate two distinct episodic tokens from the same activated type within a short interval.When the first instance(critical item1,C11hereafter)has been individuated as a token related to a particular type(e.g., the word‘‘ink,’’presented in the5th position in RSVP), the token individuation of the second occurrence that belongs to the same type(e.g.,‘‘ink’’shown in the8th position)will be inhibited,causing‘‘blindness’’to the second,repeated item.Thus,the sentence‘‘When she spilled the ink there was ink all over’’may sometimes be reported as‘‘When she spilled the ink there was all over’’due to omission of the second token(C2)of the repeated type‘‘ink,’’though at the sacrifice of grammar(Kanw-isher,1987).In this view,a general processing limit imposed on the visual system has been proposed for the mechanism of RB,and the language deficit described above is simply a manifestation of such a limitation.Although this account of RB inevitably leads to the prediction that the occurrence of RB should be inde-pendent of language-related idiosyncrasies,whether that is correct,nevertheless,remains to be seen.Doubts arise because most studies of RB have been conducted with English words when verbal items were used.Though RB has also been reliably found for languages other than English,such as French(Bavelier et al.,1994),Spanish (Altarriba&Soltano,1996),and German(Kammer et al.,1998),these languages all employ alphabetic systems.As far as we know,there are no published findings or consensus as to whether Chinese characters, a logographic script,can also produce RB.We have found in a previous study(Yeh&Li,2001) some signs of RB for repeated subcharacter compo-nents.2In the particular experiment(Experiment1)that is relevant to the current study,two different Chinese characters were shown successively in an orthographic priming paradigm.When the target character had one component in common with the preceding prime char-acter,the target character was sometimes reported as the leftover component or a new compound character that combined the leftover component with another,unpre-sented component.Since the occurrence of RB depends on the identity of the two critical items,the existence of component RB indicates that subcharacter components are also treated in the mental lexicon as independent types during the course of character recognition. Therefore,thefinding of component RB,if confirmed, will be very intriguing,for it poses an important theo-retical question on the relative roles that the whole character and its constituent components play in Chi-nese character recognition.The primary goal of this study was to examine whether RB for Chinese characters and for subcharacter compo-nents indeed exists when a conventional RB paradigm is used(e.g.,Park&Kanwisher,1994).Establishing this is important,as the component RB reported in Yeh and Li (2001)was discovered quite by accident,and not under the conventional RB paradigm.In that experiment,only two characters were shown and an orthographic priming task was used.Also,there were no nonrepeated control pairs to provide the baseline comparison against which to evaluate the performance of the repeated condition when C2remained the same in the repeated and nonrepeated conditions,the importance of which has been stressed in recent RB literature(e.g.,Downing&Kanwisher,1995). For these reasons,we conducted two experiments in this study by following the RB paradigm to examine whether RB can also be found with Chinese characters.Two kinds of RB were explored here:one is character RB(C1and C2 were identical characters),and the other is component RB (C1and C2were similar in that they shared one common component).The second goal of this study was then to examine the relative time courses of character RB and component RB by manipulating the exposure duration of each item and the lag between C1and C2.It was predicted that both character RB and component RB should be observed,but under different conditions.Particularly, component RB should be associated with shorter expo-sure duration than character RB if there exists sublexical processing in Chinese character recognition.2.Experiment1To provide the basic measurement of RB with Chi-nese characters,repetition at the character level and at the subcharacter component level werefirst examined within the same experimental framework.Items pre-sented in RSVP included Chinese characters and sym-bols,and only the characters were to be reported.To reduce memory load so as to avoid possible confound-ing variables involved with memory,only two or three characters were shown in each RSVP sequence.For the repeated trials at the character level,C1and C2were identical characters.For the repeated trials at the com-ponent level,they were similar characters that had a repeated component.Their nonrepeated control condi-tions were constructed by replacing C1with another completely different character.C1and C2were sepa-rated by another character,namely,a lag-1condition.2.1.Method2.1.1.ParticipantsForty-eight native speakers of Mandarin Chinese,all undergraduates at National Taiwan University,partici-pated in this experiment in exchange for course credit.1We followed the convention in RB literature(e.g.,Kanwisher,1991)by labeling thefirst and second critical items as C1and C2,respectively.2By component we mean either the radical that conveys themeaning or the phonetic that carries the sound of the whole character.48S.-L.Yeh,J.-L.Li/Brain and Language88(2004)47–532.1.2.Materials and apparatusThe RSVP sequence consisted of2–3Chinese char-acters in the Si-Ming style and4–5symbols that can be easily discriminated from the characters,extended about 1.2°Â2.1°at a viewing distance of55cm.Thefixation was a‘‘+’’sign,roughly equal in size to the character. Only one stimulus was presented in each frame,and it was a white character displayed in the center of a black background.For each RSVP sequence,there were7 items in total,with thefirst and last positions beingfilled with symbols.C2was always in position6.C1was presented variably in positions2,3,or4.C1and C2 were separated by another character as well as1or2 symbols.The experiment was controlled by an IBM-compatible486MHz personal computer,and the stimuli were displayed on a20-inch Eizo color monitor with a refresh rate of70Hz.Normal lighting was provided in the experimental chamber.2.1.3.DesignFor the character condition,a list of48characters,in which C1and C2were identical characters,wasfirst built(the repeated–identical,R–I condition).Two other sets of characters were then established by either re-placing C1with a nonrepeated character(the nonre-peated–identical,N–I condition)or with a blank(the blank–identical,B–I condition).For the component condition,another set of48characters,in which C1and C2were different characters with one repeated compo-nent,was constructed(the repeated component,R–C condition),along with the associated nonrepeated con-trol condition(the N–C condition)and the blank con-dition(the B–C condition).For each set of R–I vs.N–I,as well as R–C vs.N–C, target and control characters were matched in their frequency of occurrence and number of strokes,Fs<1. The characters selected were listed in Tsai(1996)as having middle to high frequency of occurrence and from the two most frequently used structures,horizontal and vertical.Target and control characters did not neces-sarily have the same structure.No homophones or synonyms existed for the characters shown in each RSVP sequence.Also,no sequence was comprised of characters that formed a two-character or three-char-acter word.The six variants of each list(R–I,N–I,B–I,and R–C, N–C,and B–C)appeared in three different versions(i.e., a yoked design)and were counterbalanced between participants.Therefore,the three versions of experi-mental trials were presented to three groups of partici-pants,with16participants in each group viewing the same versions of96lists.The presentation order of the 96trials for each participant was completely random-ized.Each item was presented for57or86ms in RSVP, and the exposure duration was a between-subjects factor.2.1.4.ProcedureEach trial was initiated by the participantÕs pressing the space bar.Afixation plus wasfirst presented for 800ms,followed by the RSVP sequence containing the stimuli,and then followed by the samefixation plus, signaling the end of the trial.Participants were told that each trial contained either2or3characters,and they should write down all the characters they saw within each RSVP sequence.They were also told that some trials contained repeated characters,and if they saw the same character twice they should write down that character twice.Each participant conducted eight practice trials before the96formal trials.2.2.Results and discussionWe followed the conventional calculation of joint probability by counting the percentage of correct re-ports of C1and C2when both were correct,due to the fact that the exact order was difficult to decide when an item list such as ours was used(e.g.,Kanwisher et al., 1999;Kanwisher,1991).As the blank conditions(i.e., the B–I and B–C conditions)were simply used asfill-ers,data in these conditions were excluded from analysis.A three-way analysis of variance(ANOVA,Chen& Cheng,1999)was conducted on the factors of exposure duration(57vs.86ms),repetition(yes vs.no),and type (character ponent).The results showed a lower degree of accuracy in the repeated condition(74%)than in the nonrepeated condition(83%),Fð1;46Þ¼24:809, MSE¼4:233,p<:0001.Also,accuracy was lower in the character conditions(77%)than in the component conditions(83%),Fð1;46Þ¼35:198,MSE¼23:49, p<:0001.The accuracy was improved(from72to86%) as the exposure duration was lengthened(from57to 86ms),Fð1;46Þ¼30:711,MSE¼7:913,p<:0001.There were also interaction effects of exposure dura-tion and repetition(Fð1;46Þ¼4:429,MSE¼4:233, p<:05),as well as repetition and type(Fð1;46Þ¼14:817, MSE¼3:657,p<:0005).Further analysis of the inter-action effects indicated that there were simple main effects of repetition at exposure durations of57ms (Fð1;46Þ¼25:101,MSE¼4:233,p<:0001)and86ms (Fð1;46Þ¼4:136,MSE¼4:233,p<:05).The accuracy at57ms was lower than at86ms for the repeated condi-tion(Fð1;92Þ¼32:666,MSE¼6:073,p<:0001)and nonrepeated condition(Fð1;92Þ¼10:436,p<:005).In the repeated condition,accuracy in reporting the repeated character was lower than for the repeated component (Fð1;92Þ¼45:077,MSE¼3:003,p<:0001).For the character condition,accuracy for a repeated character was lower than for a nonrepeated character(Fð1;92Þ¼39:299,MSE¼3:945,p<:0001),indicating RB for characters.However,for the component condition,there was no significant difference in accuracy for a repeatedS.-L.Yeh,J.-L.Li/Brain and Language88(2004)47–5349component versus a nonrepeated component (F ð1;92Þ¼1:056,p ¼:307),indicating no RB for components.These trends of statistical analysis can be depicted as in Fig.1,which plots the percent correct of C1and C2as a function of exposure duration.As is clear from this figure,a general pattern can be seen:accuracy improves with exposure duration for all four conditions.More important,the accuracy in the R–I condition was lower than that in the N–I condition,indicating an effect of RB for identical characters.However,there is no indi-cation of component RB,as the accuracy in the R–C condition was not different from that in the N–C condition.Therefore,we have established the case for RB for Chinese characters,using the RB paradigm,by show-ing that repeated characters were more difficult to re-port than nonrepeated ones.The lack of component RB,however,seemed to be at odds with our previous results (Yeh &Li,2001).There are three possible explanations for the lack of component RB in this experiment.First,if components are processed first and then combined to form the whole character,the finding of character RB but not component RB in this experiment may be due to the fact that the range of exposure durations used was in favor of the whole character,but not the component.Such a trend may be more evident when repetition at both the character level and the component level is manipulated within the same experimental framework,which may,pre-sumably,increase the contrast between the two levels of processing.In the next experiment,only repetition at the component level was used,and the exposure du-ration was reduced to see whether component RB could then be observed.Second,in this experiment,C1and C2were separated by another character (i.e.,lag ¼1),which may also override the processing of components if it occurs very early,as expected by the analytic view.This is hinted at by the fact that in Yehand Li (2001)there was no lag between the two char-acters.Accordingly,a lag 0condition was also added in the next experiment to explore whether component RB was more likely to occur when no intervening characters were presented between the two critical characters.Third,component RB may be sensitive to the relative positions of the repeated components in the whole character.That is,character structure may im-pose a constraint on the occurrence of component RB.To avoid this possible confounding condition,C1and C2were then controlled to have the same structure;only horizontal characters were used in the next experiment.3.Experiment 23.1.Method3.1.1.ParticipantsForty-two undergraduates at National Taiwan Uni-versity participated in this experiment to earn extra course credits.None had participated in the previous experiment.3.1.2.Stimulus and procedureEach RSVP sequence consisted of 6items,including 2–3Chinese characters and 3–4symbols.We manipu-lated three factors:repetition (repeated,nonrepeated),lag (0,1),and exposure duration (29,43,and 71ms).Repetition and lag were within-subject factors,and du-ration was a between-subject factor.There were 26trials with repeated conditions and 26trials with nonrepeated conditions,all of which contained 3characters and 3symbols.C2in the repeated and nonrepeated conditions remained the same,and it was viewed by two groups of 7participants in each.Another 26filler trials,which contained 2characters and 4symbols,were constructed.C1and C2were both horizontal characters.They shared one repeated component in the repeated trials and had no common component in the nonrepeated trials.For the lag-1trials,C1and C2were separated by one character and one symbol,and the intervening charac-ters were vertically structured.For the lag-0trials,C1and C2were separated by a symbol.For the two char-acters in the filler trials,the first one was a vertical character,and the second,a horizontal character.The participants were asked to write down all the characters as they had seen in each RSVP.Note that in this ex-periment there was no repetition at the character level,so that even if the participants saw two different char-acters with a common component,they should not have trouble writing down the two characters (same as has been done for orthographic repetition blindness in En-glish,Harris &Morris,2000).Other details were the same as described in Experiment1.Fig.1.Percent correct of C1and C2at different exposure durations for repeated identical characters (R–I),repeated-components (R–C),and their control conditions (N–I and N–C)in Experiment 1.50S.-L.Yeh,J.-L.Li /Brain and Language 88(2004)47–533.2.Results and discussionThe percent correct of C1and C2as a function of ex-posure duration is shown in Fig.2.ANOVA showed that there were main effects of exposure duration (F ð2;18Þ¼8:177,MSE ¼0:069,p <:005),repetition (F ð1;18Þ¼47:080,MSE ¼0:015,p <:0001),and lag (F ð1;18Þ¼8:871,MSE ¼0:013,p <:01),as well as the interaction effect of repetition and lag (F ð1;18Þ¼8:364,MSE ¼0:020,p <:01).Planned comparison (Tukey test)showed that the difference between the accuracy at 29ms was lower than at 43and 71ms,ps <:01,but there was no difference between the accuracy at the latter two durations.Further analysis of the interaction effect of repetition and lag indicated that there were simple main effects of lag in the nonrepeated condition (F ð1;36Þ¼16:956,MSE ¼0:016,p <:0005),but not in the repeated con-dition (F <1).There were also simple main effects of repetition at lag 0(F ð1;36Þ¼44:813,MSE ¼0:017,p <:0001)and lag 1(F ð1;36Þ¼5:479,MSE ¼0:017,p <:05).This pattern of results thus demonstrates the existence of component RB.Also,it was more evident when there was no intervening character between the two critical characters and when each character was presented very briefly.Further analysis by subtracting the accuracy in the repeated condition from that in the nonrepeated condition (i.e.,RB)has confirmed this.There was a main effect of lag (F ð1;39Þ¼53:553,MSE ¼5:62,p <:0001),as well as an interaction effect of lag and duration (F ð2;39Þ¼6:882,MSE ¼5:62,p <:005).At the dura-tion of 29and 43ms,the difference in the magnitude of RB was significant (F ð1;39Þ¼39:661,MSE ¼5:62,p <:0001and F ð1;39Þ¼26:03,MSE ¼5:62,p <:0001,respectively).No significant difference was found when the exposure duration was 71ms (F ð1;39Þ¼1:627,MSE ¼5:62,p ¼:21).There was a simple main effect of du-ration at lag 0(F ð2;78Þ¼6:123,MSE ¼5:79,p <:005),but not at lag 1(F ð2;78Þ¼1:632,MSE ¼5:79,p ¼:219).At lag 0,the magnitude of component RB was higher at 29and 43ms than at 71ms,ps <:05.An advantage we had in our task was that we asked the participants to write down all the characters they saw in each trial;thus,the error patterns could be fur-ther analyzed to reveal meaningful information that may be disguised in verbal reports.In the erroneous trials,typical component RB was observed as follows.For example,when C1and C2wereand ,the participants wrotedown ,,or as C2.That is,the repeated component(in this case)was either omitted (thus leaving the leftover component,),or replaced by another component (thuscausingor to be re-ported).Note that in this example,all the misreported C2(,,and )are also existing characters.Similar examples can be seen,asinforand,forand .There were also examples in which the presumed leftover,nonrepeated component was not an actual character.In these cases,only erroneous actual characters formed by combining the leftover component with another component were observed,suchasfor,for ,and for .In addition to the statistical analysis comparing the accuracy in the re-peated condition vs.the nonrepeated condition,these examples provide further evidence to confirm the exis-tence of component RB.In fact,27.47%of the errone-ous responses fell into this category in the lag-0condition,and 9.89%in the lag-1condition.The majority of the erroneous responses were blank in the repeated condition.This also indicates possible com-ponent RB,as some participants reported that some-thing seemed to be there,but it was hard for them to remember what it was.The same situation did not occur in the nonrepeated trial,yet the same C2had been used in these two conditions.Since there were no identical characters in this experiment,the blank response for C2was clearly caused by the existence of the repeated component in C1.4.General discussionWe have demonstrated in this study that repeated Chinese characters were more difficult to detect than nonrepeated ones,a phenomenon of RB also found in Chinese logographs,indicating that RB is insensitive to language-related idiosyncrasies.This is important since written Chinese provides a salient contrast with other alphabetic systems,which have been used as the major stimulus materials in existing RB literature.More sig-nificantly,RB occurred not only for identical characters,but also for similar characters that shared one repeated component.These,taken together,constitute the first findings of both character RB and component RB in the same study by adopting the conventional RBparadigm.Fig.2.Percent correct of C1and C2at different exposure duration for repeated-component at lag 0(R-0)and lag 1(R-1),as well as their control conditions (N-0and N-1)in Experiment 2.S.-L.Yeh,J.-L.Li /Brain and Language 88(2004)47–5351Several methodological advantages in this study are worth mentioning.First,our success in observing RB for Chinese characters,especially for component RB, may be largely due to the low memory load in the recall of the RSVP sequence.Participants were required to report up to three characters in this study.This may be critical because a high memory load may interfere with the processing of components.The need to reduce memory load in order to observe the effect of repetition has also been observed and emphasized in the auditory counterpart of RB,repetition deafness(e.g.,Soto-Far-aco&Sebastian-Galles,2001).Second,by asking the participants to write down the characters,the erroneous pattern of misreported characters obtained in Experi-ment2can be used to further confirm the true existence of component RB,in addition to quantitative mea-surement by summing up all the incorrect responses as done conventionally.This may shed some light on the controversy of whether RB for similar words(e.g.,fish and dish)occurred at the word level(i.e.,by treating two similar words as the same type,Chialant&Caramazza, 1997)or at the letter level(i.e.,RB for repeated letters, Morris&Harris,1999).Our results obviously support the latter view.Besides the methodological caveats for observing RB with Chinese characters,the manipulations of exposure duration and lag in this study also manifest important theoretical points.The time windows for observing component RB and character RB were different,which indicate different processing speeds for components and for whole characters during visual character recognition. Combining the results of Experiment1and2,we can see that when the presentation speed was relatively slow (>50ms/item)and there was one intervening character, only RB for repeated character was found,whereas at higher speeds(<50ms/item)with no intervening char-acters,RB for subcharacter component was then ob-served.It thus seemed that components,though embedded within the whole character,had also been identified as independent types at an earlier stage than the identification of the whole character.This result is consistent with the view that sublexical processing occurred relatively early in Chinese charac-ter recognition(Chen,Allport,&Marshall,1996; Feldman&Siok,1999;Pollatsek,Tan,&Rayner, 2000;Taft&Zhu,1997;Zhou&Marslen-Wilson, 1999).Most noticeably,in the RB paradigm employed here,the participants were required to report the whole character,yet component RB still occurred,indicating that the subcharacter components had been processed as recognizable units even though this was not part of the task demand.This pattern of results therefore ar-gues against the holistic view that each character is treated as a unit of processing,without the need to go through the level of component processing(e.g.,Yu et al.,1990).ReferencesArnell,K.M.,&Jolicoeur,P.(1997).Repetition blindness for pseudoobject pictures.Journal of Experimental Psychology:Human Perception and Performance,23,999–1013.Altarriba,J.,&Soltano, E.G.(1996).Repetition blindness and bilingual memory:Token individuation for translation equivalents.Memory and Cognition,24,700–711.Bavelier,D.(1994).Repetition blindness between visually different items:the case of pictures and words.Cognition,51,199–236. Bavelier,D.,&Potter,M.C.(1992).Visual and phonological codes in repetition blindness.Journal of Experimental Psychology:Human Perception and Performance,18,134–147.Bavelier,D.,Prasada,S.,&Segui,J.(1994).Repetition blindness between words:Nature of the orthographic and phonological representations involved.Journal of Experimental Psychology: Learning,Memory and Cognition,20,1437–1455.Chialant,D.,&Caramazza,A.(1997).Identity and similarity factors in repetition blindness:Implications for lexical process.Cognition, 63,79–119.Chen,H.C.(1984).Character detection in reading Chinese:Effects of context and display format.Chinese Journal of Psychology,26,29–34.Chen,H.C.,&Cheng,C.M.(1999).ANOVA and trend analysis statistical program for cognitive experiment.Research in Applied Psychology,1,229–246.Chen,Y.P.,Allport,D.A.,&Marshall,J.C.(1996).What are the functional orthographic units in Chinese word recognition.The stroke or stroke pattern?The Quarterly Journal of Experimental Psychology,49A,1024–1043.Cheng, C.M.(1981).Perception of Chinese characters.Chinese Journal of Psychology,23,137–153.Downing,P.,&Kanwisher,N.(1995).Types and tokens unscathed;A reply to Whittlesea,Dorken,and Podrouzek(1995)and Whittlesea and Podrouzek(1995)..Journal of Experimental Psychology: Learning,Memory and Cognition,21,1698–1702.Feldman,L.B.,&Siok,W.W.T.(1999).Semantic radicals contribute to the visual identification of Chinese characters.Journal of Memory and Language,40,559–576.Harris, C.L.,&Morris, A.L.(2000).Orthographic repetition blindness.The Quarterly Journal of Experiment Psychology,53A, 1039–1060.Kammer,T.,Saleh, F.,Oepen,G.,Manschreck,T.,Seyyedi,S., Kanwisher,N.,Furmanski,C.,&Spitzer,M.(1998).Repetition blindness in schizophrenic patients.European Archives of Psychi-atry and Clinical Neurosciences,248,136–140.Kanwisher,N.G.,Yin,C.,&Wojciulik,E.H.(1999).Repetition Blindness for Pictures:Evidence for the Rapid Computation of Abstract Visual Descriptions.In V.Coltheart(Ed.),Fleeting memories(pp.119–150).Cambridge,MA:MIT Press. Kanwisher,N.(1987).Repetition blindness:type recognition without token individuation.Cognition,27,117–143.Kanwisher,N.(1991).Repetition blindness and illusory conjunctions: Errors in binding visual types with visual tokens.Journal of Experimental Psychology:Human Perception and Performance,17, 402–421.Kanwisher,N.,&Potter,M.(1990).Repetition blindness:Levels of processing.Journal of Experimental Psychology:Human Perception and Performance,16,30–47.Kanwisher,N.,Driver,J.,&Machado,L.(1995).Spatial repetition blindness is modulated by selective attention to color or shape.Cognitive Psychology,29,303–337.Morris, A.L.,&Harris, C.L.(1999).A sublexical locus for repetition blindness:Evidence from illusory words.Journal of Experimental Psychology:Human Perception and Performance, 17,404–421.52S.-L.Yeh,J.-L.Li/Brain and Language88(2004)47–53。
当代研究生英语 第七单元 B课文翻译
价格的利润生物公司正在吞噬可改变动物DNA序列的所有专利。
这是对阻碍医学研究发展的一种冲击。
木匠认为他们的贸易工具是理所当然的。
他们买木材和锤子后,他们可以使用木材和锤子去制作任何他们所选择的东西。
多年之后来自木材厂和工具储藏室的人并没有任何进展,也没有索要利润份额。
对于那些打造明日药物的科学家们来说,这种独立性是一种罕见的奢侈品。
发展或是发现这些生物技术贸易中的工具和稀有材料的公司,对那些其他也用这些工具和材料的人进行了严格的监控。
这些工具包括关键基因的DNA序列,人类、动物植物和一些病毒的基因的部分片段,例如,HIV,克隆细胞,酶,删除基因和用于快速扫描DNA样品的DNA 芯片。
为了将他们这些关键的资源得到手,医学研究人员进场不得不签署协议,这些协议可以制约他们如何使用这些资源或是保证发现这些的公司可以得到最终结果中的部分利益。
许多学者称这抑制了了解和治愈疾病的进程。
这些建议使Harold得到了警示,Harold是华盛顿附近的美国国家卫生研究院的院长,在同年早期,他建立了一个工作小组去调查此事。
由于他的提早的调查,下个月出就能发布初步的报告。
来自安阿伯密歇根大学的法律教授,该工作组的主席Rebecea Eisenberg说,她们的工作组已经听到了好多研究者的抱怨,在它们中有一份由美国联合大学技术管理组提交的重量级的卷宗。
为了帮助收集证据,NIH建立了一个网站,在这个网站上研究者们可以匿名举报一些案件,这些案件他们相信他们的工作已经被这些限制性许可证严重阻碍了。
迫使研究人员在出版之前需要将他们的手稿展示给公司的这一保密条款和协议是投诉中最常见的原因之一。
另一个问题是一些公司坚持保有自动许可证的权利,该许可证是有关利用他们物质所生产的任何未来将被发现的产品,并且这些赋予他们对任何利用他们的工具所赚取的利润的支配权利的条款也有保有的权利。
Eisenberg说:“如果你不得不签署了许多这样的条款的话,那真的是一个大麻烦”。
谷歌生物医学专用翻译全篇
谷歌生物医学专用翻译
1 Google生物医学专用翻译
Google生物医学专用翻译(Google Bio-Medical Translation)
是Google推出的一项新服务,它的主要目的是帮助研究生物医学领域
的医学和科学家能够跨越语言壁垒,更加便捷地使用由他们创新的科
技成果和其它相关信息。
Google Bio-Medical Translation通过一个自然语言处理(NLP)、机器学习算法和深度学习模型融合而成,构建出一系列生物医学翻译
路线图上精通计算机科学和生物医学研究知识的专家。
基于此,
Google生物医学专用翻译准确地将英文生物医学文献中的概念专用语
译成非英语,以及将非英语的概念专用语译回英语,并能够在生物医
学内容的英汉双语文本中实现端到端的翻译。
Google Bio-Medical Translation把机器学习、深度学习等不同
技术结合起来,从各个层面打造了一款有效率、可靠、高准确率的生
物医学专业翻译工具,为生物医学研究、分析和应用提供了新的技术
手段,提高了生物医学研究的质量,增进了理解和运用该领域的知识。
此外,Google生物医学专用翻译不仅能有效地促进和抓住生命科学信
息的传播,也可以促进不同语言的务实合作,推动全球生命科学的可
持续发展。
Google Bio-Medical Translation让医学研究得到更多的国际支持,避免因语言差异造成的错误和停滞,助力更多生物医学研究者实
现突破,为临床医学、数字医学和公共卫生等领域提供了一种创新而有效的解决方案,助力全球健康事业独占鳌头。
基于高斯混合模型的生物医学领域双语句子对齐
(no mainRer v l b r tr If r t ti a La oao y,Dain Unv riyo c n lg o e l iest f a Teh oo y,Dain,Lio ig 1 6 2 l a a nn 1 0 4,Chn ) ia
A b ta t A ii ua e c n ofbim e c lt r s l y n m p t nt o e i o e c l r s —a sr c : blng llxio o dia e m p a s a i ora r l n bim dia c o s lngu ge i o m a in a nf r to r t iv 1 Se t nc i m e s t e fr tsep t uid a biigua e ion The Ga sa i ur o la r s e e re a. n e eal gn nti h is t O b l ln ll x c . us in m xt e m de nd tan f r
基 于高 斯 混合 模 型 的生 物 医学领 域 双语 句 子 对 齐
陈 相 ,林 鸿 飞 ,杨 志 豪
( 连 理 工 大 学 信 息 检 索 研 究 室 , 宁 大连 l6 2 ) 大 辽 1 0 4 摘
一
要 :双语 术语 词 典 在 生 物 医学跨 语 言 检 索 系统 中有 着 非 常 重 要 的 地 位 , 而双 语 句子 对 齐 是 构 建 双 语 词 典 的 第
征 进 行 训 练 , 得 模 型在 测 试语 料 上 句 子 对 齐 的 正 确 率得 到较 大ห้องสมุดไป่ตู้ 高 。 使
关 键 词 :计 算机 应 用 ; 中文 信 息 处 理 ; 句子 对 齐 ; 高斯 混 合 模 型 ; 迁移 学 习 ; 信 息 锚
英语语言学名词解释补充
Chapter 11 : Second Language Acquisition1. second language acquisition:It refers to the systematic study of how one person acquires a second language subsequent to his native language.2. target language: The language to be acquired by the second language learner.3. second language:A second language is a language which is not a native language in a country but which is widely used as a medium of communication and which is usually used alongside another language or languages.4. foreign language:A foreign language is a language which is taught asa school subject but which is not used as a medium of instruction in schools nor as a language of communication within a country.5. interlanguage: A type of language produced by second and foreign language learners, who are in the process of learning a language, and this type of language usually contains wrong expressions.6. fossilization: In second or foreign language learning, there is a process which sometimes occurs in which incorrect linguistic features become a permanent part of the way a person speaks or writes a language.7. contrastive analysis: a method of analyzing languages for instructional purposes whereby a native language and target language are compared with a view to establishing points of difference likely to cause difficulties for learners.8. contrastive analysis hypothesis: A hypothesis in second language acquisition. It predicts that where there are similarities between the first and second languages, the learner will acquire second language structure with ease, where there are differences, the learner will have difficulty.9. positive transfer:It refers to the transfer that occur when both the native language and the target language have the same form, thus making learning easier. (06F)10. negative transfer:the mistaken transfer of features of one’s native language into a second language.11. error analysis: the study and analysis of errors made by second and foreign language learners in order to identify causes of errors or common difficulties in language learning.12. interlingual error:errors, which mainly result from cross-linguistic interference at different levels such as phonological, lexical, grammatical etc.13. intralingual error:Errors, which mainly result from faulty or partial learning of the target language, independent of the native language. The typical examples are overgeneralization and cross-association.14. overgeneralization:The use of previously available strategies in new situations, in which they are unacceptable.15. cross-association: some words are similar in meaning as well as spelling and pronunciation. This internal interference is called cross-association.16. error: the production of incorrect forms in speech or writing by a non-native speaker of a second language, due to his incomplete knowledge of the rules of that target language.17. mistake:mistakes, defined as either intentionally or unintentionally deviant forms and self-corrigible, suggest failure in performance.18. input: language which a learner hears or receives and from which he or she can learn.19. intake: the input which is actually helpful for the learner.20. Input Hypothesis:A hypothesis proposed by Krashen , which states that in second language learning, it’s necessary for the learner to understand input language which contains linguistic items that are slightly beyond the learner’s present linguistic competence. Eventually the ability to produce language is said to emerge naturally without being taught directly.21. acquisition: Acquisition is a process similar to the way children acquire their first language. It is a subconscious process without minute learning of grammatical rules. Learners are hardly aware of their learning but they are using language to communicate. It is also called implicit learning, informal learning or natural learning.22. learning: learning is a conscious learning of second language knowledge by learning the rules and talking about the rules.23. comprehensible input:Input language which contains linguistic items that are slightly beyond the learner’s present linguistic competence. (06F)24. language aptitude: the natural ability to learn a language, not including intelligence, motivation, interest, etc.25. motivation: motivation is defin ed as the learner’s attitudes and affective state or learning drive.26. instrumental motivation: the motivation that people learn a foreign language for instrumental goals such as passing exams, or furthering a career etc. (06C)27. integrative motivation:the drive that people learn a foreign language because of the wish to identify with the target culture. (06C/ 05)28. resultative motivation: the drive that learners learn a second language for external purposes. (06F)29. intrinsic motivation: the drive that learners learn the second language for enjoyment or pleasure from learning.30. learning strategies:learning strategies are learners’ conscious goal-oriented and problem-solving based efforts to achieve learning efficiency.31. cognitive strategies: strategies involved in analyzing, synthesis, and internalizing what has been learned. (07C/ 06F)32. metacognitive strategies:the techniques in planning, monitoring and evaluating one’s learning.33. affect/ social strategies: the strategies dealing with the ways learners interact or communicate with other speakers, native ornon-native.Chapter 12 : Language And Brain1. neurolinguistics: It is the study of relationship between brain and language. It includes research into how the structure of the braininfluences language learning, how and in which parts of the brain language is stored, and how damage to the brain affects the ability to use language.2. psycholinguistics: the study of language processing. It is concerned with the processes of language acqisition, comprehension and production.3. brain lateralization: The localization of cognitive and perceptive functions in a particular hemisphere of the brain.4. dichotic listening:A technique in which stimuli either linguistic or non-linguistic are presented through headphones to the left and right ear to determine the lateralization of cognitive function.5. right ear advantage: The phenomenon that the right ear shows an advantage for the perception of linguistic signals id known as the right ear advantage.6. split brain studies: The experiments that investigate the effects of surgically severing the corpus callosum on cognition are called as split brain studies.7. aphasia: It refers to a number of acquired language disorders due to the cerebral lesions caused by a tumor, an accident and so on.8. non-fluent aphasia:Damage to parts of the brain in front of the central sulcus is called non-fluent aphasia.9. fluent aphasia: Damage to parts of the left cortex behind the central sulcus results in a type of aphasia called fluent aphasia.10. Acquired dyslexia: Damage in and around the angular gyrus of the parietal lobe often causes the impairment of reading and writing ability, which is referred to as acquired dyslexia.11. phonological dyslexia:it is a type of acquired dyslexia in which the patient seems to have lost the ability to use spelling-to-sound rules.12. surface dyslexia: it is a type of acquired dyslexia in which the patient seems unable to recognize words as whole but must process all words through a set of spelling-to-sound rules.13. spoonerism:a slip of tongue in which the position of sounds, syllables, or words is reversed, for example, Let’s have chish and fips instend of Let’s have fish and chips.14. priming: the process that before the participants make a decision whether the string of letters is a word or not, they are presented with an activated word.15. frequency effect: Subjects take less time to make judgement on frequently used words than to judge less commonly used words . This phenomenon is called frequency effect.16. lexical decision: an experiment that let participants judge whethera string of letter is a word or not at a certain time.17. the priming experiment:An experiment that let subjects judge whethera string of letters is a word or not after showed with a stimulus word, called prime.18. priming effect:Since the mental representation is activated through the prime, when the target is presented, response time is shorter that it otherwise would have been. This is called the priming effect. (06F)19. bottom-up processing: an approach that makes use principally of information which is already present in the data.20. top-down processing:an approach that makes use of previous knowledge and experience of the readers in analyzing and processing information which is received.21. garden path sentences: a sentence in which the comprehender assumesa particular meaning of a word or phrase but discovers later that the assumption was incorrect, forcing the comprehender to backtrack and reinterpret the sentence.22. slip of the tongue:mistakes in speech which provide psycholinguistic evidence for the way we formulate words and phrases.。
中科院博士研究生英语精读教材翻译及原文整理解读
第1课知识的悖论The Paradox of KnowledgeThe greatest achievement of humankind in its long evolution from ancient hominoid ancestors to its present status is the acquisition and accumulation of a vast body of knowledge about itself, the world, and the universe. The products of this knowledge are all those things that, in the aggregate, we call "civilization," including language, science, literature, art, all the physical mechanisms, instruments, and structures we use, and the physical infrastructures on which society relies. Most of us assume that in modern society knowledge of all kinds is continually increasing and the aggregation of new information into the corpus of our social or collective knowledge is steadily reducing the area of ignorance about ourselves, the world, and the universe. But continuing reminders of the numerous areas of our present ignorance invite a critical analysis of this assumption.In the popular view, intellectual evolution is similar to, although much more rapid than, somatic evolution. Biological evolution is often described by the statement that "ontogeny recapitulates phylogeny"--meaning that the individual embryo, in its development from a fertilized ovum into a human baby, passes through successive stages in which it resembles ancestral forms of the human species. The popular view is that humankind has progressed from a state of innocent ignorance, comparable to that of an infant, and gradually has acquired more and more knowledge, much as a child learns in passing through the several grades of the educational system. Implicit in this view is an assumption that phylogeny resembles ontogeny, so that there will ultimately be a stage in which the accumulation of knowledge is essentially complete, at least in specific fields, as if society had graduated with all the advanced degrees that signify mastery of important subjects.Such views have, in fact, been expressed by some eminent scientists. In 1894 the great American physicist Albert Michelson said in a talk at the University of Chicago:While it is never safe to affirm that the future of Physical Science has no marvels in store even more astonishing than those of the past, it seems probable that most of the grand underlying principles have been firmly established and that further advances are to be sought chiefly in the rigorous application of these principles to all the phenomena which come under our notice .... The future truths of Physical Science ate to be looked for in the sixth place of decimals.In the century since Michelson's talk, scientists have discovered much more than the refinement of measurements in the sixth decimal place, and none is willing to make a similar statement today. However, many still cling to the notion that such a state of knowledge remains a possibility to be attained sooner or later. Stephen Hawking, thegreat English scientist, in his immensely popular book A Brief History of Time (1988), concludes with the speculation that we may "discover a complete theory" that "would be the ultimate triumph of human reason--for then we would know the mind of God." Paul Davies, an Australian physicist, echoes that view by suggesting that the human mind may be able to grasp some of the secrets encompassed by the title of his book The Mind of God (1992). Other contemporary scientists write of "theories of everything," meaning theories that explain all observable physical phenomena, and Nobel Laureate Steven Weinberg, one of the founders of the current standard model of physical theory, writes of his Dreams of a Final Theory (1992).Despite the eminence and obvious yearning of these and many other contemporary scientists, there is nothing in the history of science to suggest that any addition of data or theories to the body of scientific knowledge will ever provide answers to all questions in any field. On the contrary, the history of science indicates that increasing knowledge brings awareness of new areas of ignorance and of new questions to be answered.Astronomy is the most ancient of the sciences, and its development is a model of other fields of knowledge. People have been observing the stars and other celestial bodies since the dawn of recorded history. As early as 3000 B.C. the Babylonians recognized a number of the constellations. In the sixth century B.C., Pythagoras proposed the notion of a spherical Earth and of a universe with objects in it chat moved in accordance with natural laws. Later Greek philosophers taught that the sky was a hollow globe surrounding the Earth, that it was supported on an axis running through the Earth, and chat stars were inlaid on its inner surface, which rotated westward daily. In the second century A.D., Ptolemy propounded a theory of a geocentric (Earth-centered) universe in which the sun, planets, and stars moved in circular orbits of cycles and epicycles around the Earth, although the Earth was not at the precise center of these orbits. While somewhat awkward, the Ptolemaic system could produce reasonably reliable predictions of planetary positions, which were, however, good for only a few years and which developed substantial discrepancies from actual observations over a long period of time. Nevertheless, since there was no evidence then apparent to astronomers that the Earth itself moves, the Ptolemaic system remained unchallenged for more than 13 centuries.In the sixteenth century Nocolaus Copernicus, who is said to have mastered all the knowledge of his day in mathematics, astronomy, medicine, and theology, became dissatisfied with the Ptolemaic system. He found that a heliocentric system was both mathematically possible and aesthetically more pleasing, and wrote a full exposition of his hypothesis, which was not published until 1543, shortly after his death. Early inthe seventeenth century, Johannes Kepler became imperial mathematician of the Holy Roman Empire upon the death of Tycho Brahe, and he acquired a collection of meticulous naked-eye observations of the positions of celestial bodies chat had been made by Brahe. On the basis of these data, Kepler calculated that both Ptolemy and Copernicus were in error in assuming chat planets traveled in circular orbits, and in 1609 he published a book demonstrating mathematically chat the planets travel around the sun in elliptical orbits. Kepler's laws of planetary motion are still regarded as basically valid.In the first decade of the seventeenth century Galileo Galilei learned of the invention of the telescope and began to build such instruments, becoming the first person to use a telescope for astronomical observations, and thus discovering craters on the moon, phases of Venus, and the satellites of Jupiter. His observations convinced him of the validity of the Copernican system and resulted in the well-known conflict between Galileo and church authorities. In January 1642 Galileo died, and in December of chat year Isaac Newton was born. Modern science derives largely from the work of these two men.Newton's contributions to science are numerous. He laid the foundations for modem physical optics, formulated the basic laws of motion and the law of universal gravitation, and devised the infinitesimal calculus. Newton's laws of motion and gravitation are still used for calculations of such matters as trajectories of spacecraft and satellites and orbits of planets. In 1846, relying on such calculations as a guide to observation, astronomers discovered the planet Neptune.While calculations based on Newton's laws are accurate, they are dismayingly complex when three or more bodies are involved. In 1915, Einstein announced his theory of general relativity, which led to a set of differential equations for planetary orbits identical to those based on Newtonian calculations, except for those relating to the planet Mercury. The elliptical orbit of Mercury rotates through the years, but so slowly that the change of position is less than one minute of arc each century. The equations of general relativity precisely accounted for this precession; Newtonian equations did not.Einstein's equations also explained the red shift in the light from distant stars and the deflection of starlight as it passed near the sun. However, Einstein assumed chat the universe was static, and, in order to permit a meaningful solution to the equations of relativity, in 1917 he added another term, called a "cosmological constant," to the equations. Although the existence and significance of a cosmological constant is still being debated, Einstein later declared chat this was a major mistake, as Edwin Hubble established in the 1920s chat the universe is expanding and galaxies are receding fromone another at a speed proportionate to their distance.Another important development in astronomy grew out of Newton's experimentation in optics, beginning with his demonstration chat sunlight could be broken up by a prism into a spectrum of different colors, which led to the science of spectroscopy. In the twentieth century, spectroscopy was applied to astronomy to gun information about the chemical and physical condition of celestial bodies chat was not disclosed by visual observation. In the 1920s, precise photographic photometry was introduced to astronomy and quantitative spectrochemical analysis became common. Also during the 1920s, scientists like Heisenberg, de Broglie, Schrodinger, and Dirac developed quantum mechanics, a branch of physics dealing with subatomic particles of matter and quanta of energy. Astronomers began to recognize that the properties of celestial bodies, including planets, could be well understood only in terms of physics, and the field began to be referred to as "astrophysics."These developments created an explosive expansion in our knowledge of astronomy. During the first five thousand years or more of observing the heavens, observation was confined to the narrow band of visible light. In the last half of this century astronomical observations have been made across the spectrum of electromagnetic radiation, including radio waves, infrared, ultraviolet, X-rays, and gamma rays, and from satellites beyond the atmosphere. It is no exaggeration to say chat since the end of World War II more astronomical data have been gathered than during all of the thousands of years of preceding human history.However, despite all improvements in instrumentation, increasing sophistication of analysis and calculation augmented by the massive power of computers, and the huge aggregation of data, or knowledge, we still cannot predict future movements of planets and other elements of even the solar system with a high degree of certainty. Ivars Peterson, a highly trained science writer and an editor of Science News, writes in his book Newton's Clock (1993) that a surprisingly subtle chaos pervades the solar system. He states:In one way or another the problem of the solar system's stability has fascinated and tormented asrtonomers and mathematicians for more than 200 years. Somewhat to the embarrassment of contemporary experts, it remains one of the most perplexing, unsolved issues in celestial mechanics. Each step toward resolving this and related questions has only exposed additional uncertainties and even deeper mysteries.Similar problems pervade astronomy. The two major theories of cosmology, general relativity and quantum mechanics, cannot be stated in the same mathematical language, and thus are inconsistent with one another, as the Ptolemaic and Copernicantheories were in the sixteenth century, although both contemporary theories continue to be used, but for different calculations. Oxford mathematician Roger Penrose, in The Emperors New Mind (1989), contends that this inconsistency requires a change in quantum theory to provide a new theory he calls "correct quantum gravity."Furthermore, the observations astronomers make with new technologies disclose a total mass in the universe that is less than about 10 percent of the total mass that mathematical calculations require the universe to contain on the basis of its observed rate of expansion. If the universe contains no more mass than we have been able to observe directly, then according to all current theories it should have expanded in the past, and be expanding now, much more rapidly than the rate actually observed. It is therefore believed that 90 percent or more of the mass in the universe is some sort of "dark matter" that has not yet been observed and the nature of which is unknown. Current theories favor either WIMPs (weakly interacting massive particles) or MACHOs (massive compact halo objects). Other similar mysteries abound and increase in number as our ability to observe improves.The progress of biological and life sciences has been similar to that of the physical sciences, except that it has occurred several centuries later. The theory of biological evolution first came to the attention of scientists with the publication of Darwin's Origin of Species in 1859. But Darwin lacked any explanation of the causes of variation and inheritance of characteristics. These were provided by Gregor Mendel, who laid the mathematical foundation of genetics with the publication of papers in 1865 and 1866.Medicine, according to Lewis Thomas, is the youngest science, having become truly scientific only in the 1930s. Recent and ongoing research has created uncertainty about even such basic concepts as when and how life begins and when death occurs, and we are spending billions in an attempt to learn how much it may be possible to know about human genetics. Modern medicine has demonstrably improved both our life expectancies and our health, and further improvements continue to be made as research progresses. But new questions arise even more rapidly than our research resources grow, as the host of problems related to the Human Genome Project illustrates.From even such an abbreviated and incomplete survey of science as this, it appears that increasing knowledge does not result in a commensurate decrease in ignorance, but, on the contrary, exposes new lacunae in our comprehension and confronts us with unforeseen questions disclosing areas of ignorance of which we were not previously aware.Thus the concept of science as an expanding body of knowledge that will eventually encompass or dispel all significant areas of ignorance is an illusion. Scientists and philosophers are now observing that it is naive to regard science as a process that begins with observations that are organized into theories and are then subsequently tested by experiments. The late Karl Popper, a leading philosopher of science, wrote in The Growth of Scientific Knowledge (1960) chat science starts from problems, not from observations, and chat every worthwhile new theory raises new problems. Thus there is no danger that science will come to an end because it has completed its task, clanks to the "infinity of our ignorance."At least since Thomas Kuhn published The Structure of Scientific Revolutions (1962), it has been generally recognized that observations are the result of theories (called paradigms by Kuhn and other philosophers), for without theories of relevance and irrelevance there would be no basis for determining what observations to make. Since no one can know everything, to be fully informed on any subject (a claim sometimes made by those in authority) is simply to reach a judgment that additional data are not important enough to be worth the trouble of securing or considering.To carry the analysis another step, it must be recognized that theories are the result of questions and questions are the product of perceived ignorance. Thus it is chat ignorance gives rise to inquiry chat produces knowledge, which, in turn, discloses new areas of ignorance. This is the paradox of knowledge: As knowledge increases so does ignorance, and ignorance may increase more than its related knowledge.My own metaphor to illustrate the relationship of knowledge and ignorance is based on a line from Matthew Arnold: "For we are here as on a darkling plain...." The dark chat surrounds us, chat, indeed, envelops our world, is ignorance. Knowledge is the illumination shed by whatever candles (or more technologically advanced light sources) we can provide. As we light more and more figurative candles, the area of illumination enlarges; but the area beyond illumination increases geometrically. We know chat there is much we don't know; but we cannot know how much there is chat we don't know. Thus knowledge is finite, but ignorance is infinite, and the finite cannot ever encompass the infinite.This is a revised version of an article originally published in COSMOS 1994. Copyright 1995 by Lee Loevinger.Lee Loevinger is a Washington lawyer and former assistant attorney general of the United States who writes frequently for scientific c publications. He has participated for many years as a member, co-chair, or liaison with the National Conference of Lawyers and Scientists, and he is a founder and former chair of the Science andTechnology Section of the American Bar Association. Office address: Hogan and Hartson, 555 Thirteenth St. NW, Washington, DC 20004.人类从古类人猿进化到当前的状态这个长久的进化过程中的最大成就是有关于人类自身、世界以及宇宙众多知识的获得和积聚。
bioinformatics with editor和with reviewer变换
bioinformatics with editor和with reviewer变换Bioinformatics is a rapidly growing field that combines biology and computer science to analyze and interpret biological data. Within this field, there are two primary roles that play a crucial role in the development and dissemination of research findings - the editor and the reviewer. Both of these roles are essential in ensuring the quality and validity of scientific research.Editors in the field of bioinformatics are responsible for overseeing the publication process of research articles. They play a crucial role in selecting manuscripts for publication and ensuring that they meet the standards of the journal. Editors are typically experts in the field and have a deep understanding of the latest advancements and techniques in bioinformatics. They evaluate the significance and novelty of the research, as well as the overall quality of the manuscript.The first step in the editorial process is the submission of a research manuscript by the authors. Editors then review the initial submission to determine if it meets the journal's scope and criteria. If the manuscript is deemed suitable for further evaluation, it is sentout for peer review. The editor identifies potential reviewers who possess the necessary expertise to assess the scientific validity and rigor of the research presented in the manuscript.The role of the reviewer is to critically evaluate the manuscript and provide feedback to the editor. Reviewers are typically researchers or experts in the field who have a deep understanding of the topic being investigated. They assess the methodology, data analysis, and interpretation of the results to ensure that they are accurate and supported by the presented evidence.Reviewers also assess the clarity of the manuscript and provide constructive feedback on how it can be improved. They may suggest additional experiments, analysis, or clarifications to strengthen the research. The feedback from the reviewer is anonymous and confidential, allowing them to provide unbiased assessments of the manuscript.Once the review process is complete, the editor carefully considers the feedback provided by the reviewers. They evaluate the strengths and weaknesses of the manuscript and determine whether it should be accepted, revised, or rejected. If revisions arerequired, the editor communicates the reviewer's feedback to the authors and provides them with an opportunity to address the concerns raised.Authors then revise their manuscript based on the reviewer's feedback and resubmit it for re-evaluation. The editor evaluates the revised manuscript to ensure that the author's responses adequately address the concerns raised by the reviewers. If the editor is satisfied with the revisions, the manuscript may proceed to the final stages of production, including copyediting, proofreading, and formatting.The role of the editor continues throughout the publication process, ensuring that the manuscript is formatted correctly and that it adheres to the journal's guidelines. Editors are also responsible for coordinating with the authors and reviewers to address any remaining concerns or questions.In conclusion, both the editor and the reviewer play crucial roles in the publication process of bioinformatics research. The editor oversees the entire process, from initial submission to finalpublication, ensuring that the research meets the journal's standards. Reviewers, on the other hand, assess the scientific validity and rigor of the research, providing feedback to improve the manuscript. Together, the editor and reviewer ensure the quality and integrity of bioinformatics research findings.。
蛋白质 自然语言处理
蛋白质自然语言处理摘要:1.蛋白质的定义和作用2.自然语言处理的发展3.蛋白质与自然语言处理的关系4.蛋白质在自然语言处理中的应用前景正文:蛋白质是生命体内最为重要的分子之一,它在细胞中扮演着关键的角色,包括结构支持、酶催化、信号传导等。
近年来,随着科学技术的进步,蛋白质研究逐渐深入到各个领域,其中之一就是自然语言处理。
自然语言处理(Natural Language Processing, NLP)是计算机科学和人工智能领域中的一个重要分支,它研究如何让计算机理解和处理人类语言。
自20世纪50年代以来,自然语言处理经历了多次发展浪潮,如今已经取得了显著的成果,例如机器翻译、情感分析、语音识别等。
蛋白质与自然语言处理之间存在着密切的联系。
首先,蛋白质的序列决定了它的结构和功能,而自然语言处理技术可以帮助科学家分析和理解蛋白质的序列信息。
通过深度学习、模式识别等方法,研究人员可以挖掘蛋白质序列中的规律,进一步揭示蛋白质的结构和功能。
此外,蛋白质在自然语言处理中的应用也取得了重要进展。
例如,利用蛋白质结构预测算法,可以推测出蛋白质的三维结构,从而为自然语言处理中的词向量表示提供依据。
同时,蛋白质的相互作用网络也可以用于构建自然语言处理模型,提高模型的准确性和鲁棒性。
在未来,随着蛋白质研究的不断深入,自然语言处理技术将会在蛋白质领域发挥更大的作用。
通过蛋白质和自然语言处理技术的相互促进,有望在生物信息学、药物设计等领域取得重大突破。
总之,蛋白质与自然语言处理之间存在着紧密的联系。
蛋白质为自然语言处理提供了丰富的研究素材,而自然语言处理技术则为蛋白质研究提供了强大的工具。
BA and BP
姓在前,名在后, 姓与名之间用逗号 隔开
AUTHOR INDEX (Personal or Corporate Names)
页码
Name Adams Research Study Group………………………….. Glanzman, David L……………………………………… Hempel, Arnold T,Jr……………………………………... Malereau, Marie-Pierre………………………………….. Mc Dermott, J.R…………………………………………. Murphy , Geoffrey G……………………………………... Ref. No 2468 9890 1256 3345 10323 9890
应的页码
文摘部分
文摘部分是BA的主要部分, 文摘部分是 的主要部分,其总体的编排 的主要部分 顺序是按主要概念词等级表顺序排列,每一主要 顺序是按主要概念词等级表顺序排列, 概念词下均有参照项—see also,列出与该类目 概念词下均有参照项 , 相关的其他主要概念词语, 相关的其他主要概念词语,同一概念词语下的文 献按文摘号排列, 献按文摘号排列,在BA中文献条目的文摘号是唯 中文献条目的文摘号是唯 一的,基本上每一条文献都提供摘要。 一的,基本上每一条文献都提供摘要。
著者索引( 著者索引(Author Index) )
著者索引部分的著者包括个人著者 团体著 个人著者、团体著 个人著者 者,主要用于查找已知的特定个人名字或团 体名称的文献资料。 著者索引按个人著者和团体著者姓名的英文 英文 字母顺序排列,并且对每一位著者或合著者 字母 都建立了独立的索引条目。
著录格式
语种,摘要 带“*”著者所 在单位和地址 摘要 文献篇名
I2B2总结
I2B2总结摘要:2010 i2b2/VA关于自然语言处理挑战临床记录的研讨会提出了三个任务:一个概念提取任务,从病人报告中提取医学概念;一个断言分类任务,对医学概念分别归类;一个关系分类任务,分配在医疗问题,测试和治疗之间的关系类型。
本文先简要介绍三个任务的描述,然后在概念提取、断言分类和关系分类的一般的研究方法上,介绍三个任务对应的主要研究方法。
0 介绍2010 年 i2b2与盐湖城卫生保健局组织标注了一系列机构的电子病历数据,并且在此基础之上组织了电子病历领域的信息抽取的评测(2010 i2b2/VA challenge)[28].概念抽取被设计为一个信息抽取任务,识别并提取与患者医疗问题(problem)、治疗(treament)和测试(test)相对应的文本。
断言分类,它的目标是把医学概念(模拟为疾病)归类到病人当前患有该疾病(present),没有该疾病(absent),可能患有该疾病(possible),病人只在某些情况下才会有该疾病(Conditional),患者可能会发展到该疾病(Hypothetical),该疾病与病人无关(Not associated with the patient)。
关系分类,旨在从一个句子中按照给定参考标准概念对关系分类,分类标准如下:1.医疗问题与治疗的关系:▪TrIP:治疗改善了医疗问题▪TrWP:治疗恶化的医疗问题▪TrCP: 治疗导致医疗问题▪ TrAP: 治疗管理医疗问题▪ TrNAP: 治疗因医疗问题而不被管理2.医疗问题与测试的关系:▪TeRP:测试显示医疗问题▪TeCP: 测试进行以调查医疗问题3.医疗问题与医疗问题的关系:▪PIP:医疗问题表明医疗问题1 注释格式1.1 概念抽取格式输入一段病人报告文本,每个概念抽取输出的结果格式如下:c = “概念文本”偏移 ||t = “概念类型”其中c表示一个概念的提及。
概念文本将替换为报表中的实际文本;偏移量表示跨越概念文本的开始和结束行和单词编号。
双语术语提取算法
双语术语提取算法双语术语提取算法(Bilingual Terminology Extraction Algorithm)引言:随着全球化的加深,多语言信息处理的需求也越来越迫切。
在这种背景下,双语术语提取算法成为了研究的热点之一。
通过提取两种语言中的术语,可以帮助人们更好地理解不同语言之间的关联,从而促进跨语言交流和信息处理的效率。
一、双语术语提取算法的定义双语术语提取算法是指通过对两种语言的文本进行分析和处理,从中提取出两种语言共有的术语。
这些术语是在不同领域中广泛使用的专业词汇,对于深入了解特定领域的文本非常重要。
1. 数据预处理在进行双语术语提取之前,首先需要对两种语言的文本数据进行预处理。
这包括去除标点符号、停用词等无关信息,并进行分词和词性标注等处理。
2. 术语候选项提取接下来,通过使用词频、互信息等统计方法,从预处理后的文本数据中提取出术语的候选项。
这些候选项是潜在的术语,需要进一步的筛选和验证。
3. 术语筛选与验证在候选项中,可能存在一些不是真正的术语,因此需要进行筛选与验证。
常用的方法包括基于词性、词义、语境等特征的术语识别算法。
这些算法可以帮助我们确定哪些候选项是真正的术语。
4. 双语术语对齐在确定了两种语言中的术语后,还需要对这些术语进行对齐。
通过比较两种语言中术语的相似性,可以找到它们之间的对应关系。
这个过程需要使用双语词典、翻译模型等工具。
5. 术语评估与优化需要对提取出的双语术语进行评估与优化。
可以使用专家评价、领域知识等方法来判断提取的术语是否正确和完整,并对算法进行改进和优化。
三、双语术语提取算法的应用领域双语术语提取算法在多个领域有着广泛的应用。
例如,在机器翻译中,通过提取源语言和目标语言中的术语,可以帮助改善翻译质量。
在自然语言处理中,双语术语提取可以用于构建双语词典、术语库等资源,为其他任务提供基础支持。
双语术语提取算法在跨语言信息检索、知识图谱构建、专业领域信息抽取等方面也有着重要的应用。
医学英语口语:基因专业词汇AB(写写帮整理)
医学英语口语:基因专业词汇AB(写写帮整理)第一篇:医学英语口语:基因专业词汇AB(写写帮整理)医学英语口语:基因专业词汇A/BA腺苷脱氨酶缺乏症adenosine deaminase deficiency(ADA)腺病毒adenovirusAlagille综合征Alagille syndrome等位基因allele氨基酸amino acids动物模型animal model抗体antibody凋亡apoptosis路-巴综合征 ataxia-telangiectasia常染色体显性 autosomal dominant常染色体autosomeB细菌人工染色体bacterial artificial chromosome(BAC)碱基对base pair先天缺陷 birth defect骨髓移植 bone marrow transplantationBRCA1/BRCA2原文来自必克英语第二篇:医学英语口语基因专业词汇OP医学英语口语: 基因专业词汇O/PO寡核苷酸 oligo癌基因 oncogenePp53Parkinson病 Parkinson's disease专利权 patent血系/谱系 pedigree表型 phenotype物理图谱 physical map多指畸形/多趾畸形 polydactyly聚合酶链反应polymerase chain reaction(PCR)多态性polymorphism定位克隆 positional cloning原发性免疫缺陷 primary immunodeficiency引物 primer原核 pronucleus前列腺癌 prostate cancer蛋白 protein原文来自必克英语第三篇:医学英语口语:基因专业词汇EFG医学英语口语:基因专业词汇E/F/GE电泳 electrophoresisEllis-van Creveld syndrome酶 enzyme外显子 exonF家族性地中海热 familial Mediterranean fever荧光原位杂交 fluorescence in situ hybridization(FISH)脆性X染色体综合征 Fragile X syndromeG基因 gene基因扩增 gene amplification基因表达 gene expression基因图谱 gene mapping基因库 gene pool基因治疗 gene therapy基因转移 gene transfer遗传密码 genetic code(ATGC)遗传咨询 genetic counseling遗传图 genetic map遗传标记 genetic marker遗传病筛查 genetic screening基因组 genome基因型 genotype种系 germ line原文来自必克英语第四篇:医学英语口语基因专业词汇RS医学英语口语: 基因专业词汇R/SR隐性 recessive逆转录病毒 retrovirus核糖核酸 ribonucleic acid(RNA)核糖体 ribosomerisk communicationS序列标记位点 sequence-tagged site(STS)联合免疫缺陷 severe combined immunodeficiency(SCID)性染色体 sex chromosome伴性的 sex-linked体细胞 somatic cellsDNA印记 Southern blot光谱核型 spectral karyotype(SKY)替代 substitution自杀基因 suicide gene综合征 syndrome原文来自必克英语第五篇:医学英语口语基因专业词汇MN[范文]医学英语口语: 基因专业词汇M/NM畸形 malformation描图 mapping标记 marker黑色素瘤 melanoma孟德尔 Mendel, Johann(Gregor)孟德尔遗传 Mendelian inheritance信使RNA messenger RNA(mRNA)[分裂]中期 metaphase微阵技术 microarray technology线立体DNA mitochondrial DNA单体性 monosomy小鼠模型 mouse model多发性内分泌瘤病 multiple endocrine neoplasia, type 1(MEN1)突变 mutationN神经纤维瘤病 neurofibromatosis尼曼-皮克病Niemann-Pick disease, type C(NPC)non-directivenessRNA印记 Northern blot核苷酸 nucleotide神经核 nucleus原文来自必克英语。
科学家成功创造新的DNA来制造“外星”生命形式
科学家成功创造新的DNA来制造“外星”生命形式如字面上的意思,在科学家成功地创造人造DNA,第一次改变生命密码之后,他们就快要创造出新的“外星”生命形态。
打开连尚头条,看更多新鲜猛料当科技进步,科学家得以达成先前认为不可能的事。
现在,在科学家第一次扩展基因密码之后,他们就快要创造出新的生命形态。
美国科学对常见的大肠杆菌进行基因工程,加入在自然状态下不存在于菌中的分子。
在地球生命漫长的历史中,生命的密码由DNA“程序化”,DNA 由四个原则字母组成:G(鸟嘌呤)、T (胸腺嘧啶)、C(胞嘧啶)、A(腺嘌呤)。
这些分子与DNA螺旋相合,最终给我们专属的独特密码。
然而,科学家扩展了密码,加入两个新的分子,分别称为X 和Y,创造出一个全新的生命形态。
这样做的理由是?答案是药。
这个革新计画背后的专家们表示,他们已经创造出人造DNA,为了让细菌制造出新的蛋白质形态,这种新形态最终可以用于不同的医疗用途。
说到突破性的成就,美国加州的斯克利普斯研究所(Scripps Research Institute)的首席科学家 Romesberg 博士说:“你的基因组必须稳定,你的人生才能长久。
如果半合成有机体要成为一个真正的器官,它必须能维持稳定。
地球上的生物多样性都是由两对DNA编码出来的,也就是 A-T 和 C-G 碱基对,而我们所做的就是创造一个除了拥有这两对外,还有非自然的第三对碱基对的有机体,并且是以稳定的状态存在。
这显示其他种储存信息的方法是可能的,也让我们更接近一个扩展DNA的生物学,这种生物学可以有许多令人兴奋的应用,从新药到新的纳米技术。
”但整个程序并不在这里结束。
事实上,根据刊登在美国科学学院院刊《Proceedings of the National Academy of Sciences》的研究,下一个步骤是证明那个人造、非自然的DNA可以被转录成细菌的RNA分子,这可以让研究人员拥有数不清的可能性,让他们控制细菌的未来行动。
- 1、下载文档前请自行甄别文档内容的完整性,平台不提供额外的编辑、内容补充、找答案等附加服务。
- 2、"仅部分预览"的文档,不可在线预览部分如存在完整性等问题,可反馈申请退款(可完整预览的文档不适用该条件!)。
- 3、如文档侵犯您的权益,请联系客服反馈,我们会尽快为您处理(人工客服工作时间:9:00-18:30)。
Two biomedical sublanguages:a description basedon the theories of Zellig HarrisCarol Friedman,a,b,*Pauline Kra,a and Andrey Rzhetsky a,caDepartment of Medical Informatics,Columbia University,VC5,Vanderbilt Building,622West 168th Street,New York,NY 10032-3720,USAbDepartment of Computer Science,Queens College CUNY,65-30Kissens Blvd.,Flushing,NY 11367,USAcGenome Center,Columbia University,1150St.Nicholas Blvd.,Russ Berrie Pavilion,Informatics,Room 121H,New York,NY 10032,USAReceived 15August 2002AbstractNatural language processing (NLP)systems have been developed to provide access to the tremendous body of data and knowledge that is available in the biomedical domain in the form of natural language text.These NLP systems are valuable because they can encode and amass the information in the text so that it can be used by other automated processes to improve patient care and our understanding of disease processes and treatments.Zellig Harris proposed a theory of sublanguage that laid the foundation for natural language processing in specialized domains.He hypothesized that the informational content and structure form a specialized language that can be delineated in the form of a sublanguage grammar.The grammar can then be used by a language processor to capture and encode the salient information and relations in text.In this paper,we briefly summarize his language and sublanguage theories.In addition,we summarize our prior research,which is associated with the sublanguage grammars we developed for two different biomedical domains.These grammars illustrate how Harris Õtheories provide a basis for the development of language processing systems in the biomedical domain.The two domains and their associated sublanguages discussed are:the clinical domain,where the text consists of patient reports,and the biomolecular domain,where the text consists of complete journal articles.Ó2003Elsevier Science (USA).All rights reserved.1.IntroductionWith the constantly increasing availability of online textual information and computational power,we ex-perience increased utilization of natural language pro-cessing (NLP)techniques in the biomedical domain.In the 2001and 2002AMIA Fall Symposiums there were 10and 14papers,respectively,on natural language processing as compared to a handful of papers in ses-sions before 1998.Similarly,the Pacific Symposium on Biocomputing (PSB)has experienced an increased in-terest in the use of NLP for mining the literature for knowledge acquisition and improving retrieval of the literature.The NLP sessions at the 2001,2002,and 2003PSB conferences attracted 10,17,and 15submissions,respectively,from which 4,6,and 5,respectively,were accepted.In addition,in 2002and 2003the Associationof Computational Linguistics held the first two work-shops on biomedical language processing.NLP methodology has been used to obtain and structure clinical and biomolecular information.Diverse NLP clinical applications have been developed to be used for decision support [1,2],encoding [3–6],data mining and clinical research [7–9],order entry [10],in-formation retrieval [11–13],and controlled vocabulary [14].A number of evaluations of these applications demonstrated they were effective for realistic clinical applications.Additionally,NLP has been used to im-prove access to the biomedical literature.In the last few years,a substantial effort has been associated with identifying biomolecular substances.One type of system primarily identifies gene or protein names in biological texts,[15–18],and others extract relations between the substances in order to automatically acquire knowledge and to improve retrieval of information [19–26].Zellig Harris proposed a theory of sublanguages [27,28]that explains why it is possible to process lan-guage in specialized textual domains,such asthoseJournal of Biomedical Informatics 35(2002)222–235/locate/yjbin*Corresponding author.Fax:1-212-305-3302.E-mail address:friedman@ (C.Friedman).1532-0464/02/$-see front matter Ó2003Elsevier Science (USA).All rights reserved.doi:10.1016/S1532-0464(03)00012-1found in genomics and medicine.According to Harris, the languages of technical domains have a structure and regularity,which can be observed by examining the corpora of the domains,and which can be delineated so that the structure can be specified in a form suitable for computation.Whereas the general English grammar theory primarily specifies well-formed syntactic struc-tures only,HarrisÕs sublanguage grammar theory also incorporates domain-specific semantic information and relationships to delineate a language that is more in-formative than English because it reflects the subject matter and relations of a domain as well as the syntactic structure.The scientific grounding of HarrisÕs sublanguage theory is well established and has been repeatedly veri-fied by the vast amount of work that has been done in this area.A set of papers on sublanguage processing and research collected by Grishman and Kittridge[29]in-cludes the domains of lipoprotein kinetics[29]clinical patient reports[30],telegraphic navy messages[31,32], and reporting of events in outer space[33].Additional work pertaining to the sublanguages of pharmacological literature and lipid metabolism is described in Sager[34]. An earlier collection of papers on this subject was edited by Kittredge and Lehrberger[35].Wefind that HarrisÕs principles are applicable to much of our work in bio-medical language processing.In this paper we describe properties of the languages of two specialized domains in biomedicine,the clinical domain and the biomolecu-lar domain,and show how Harris provides a linguistic foundation for our work.In the next section we provide background material by summarizing important aspects of HarrisÕs principles on language and sublanguage.We then present thefirst major use of those principles,which resulted in the de-velopment of a comprehensive sublanguage grammar and medical language processing system by Sager[36] who heads the Linguistic String Project.The subsequent section analyzes features of the language of the clinical domain,which is followed by a section on the language of the biomolecular domain.Finally,we discuss differ-ences and similarities of the two sublanguages,and provide our conclusion.2.Background2.1.An overview of Harris’theory on language and sublanguageHarris postulated that all occurrences of language are word sequences satisfying certain constraints which ex-press and transmit information.The constraints are: dependency relations[28;54–61],paraphrastic reduc-tions[28;79–96],and inequalities of likelihood[28;61–79].Additionally,certain subsets of languages exist(i.e.,sublanguages)within specialized domains that exhibit specialized constraints due to limitations of the words and relations of the subject matter[28;272–318].Dependency relations are concerned with syntactic regularities,and are applicable to all general language as well as to specialized languages.The crucial property is the dependence of words in a sentence on other words, and the categorization of words accordingly.Basically, words that are nouns or concrete objects are considered zero-level words(e.g.,cats,fish)because they do not depend on other words in the sentence.More specifi-cally,zero-level words occur with other words that are first or second level words.In contrast,words that are verbs are eitherfirst-level or second-level words,which are considered to be operators that are dependent on their arguments.Sentences containing afirst-level word (i.e.,eat as in cats eatfish)must contain zero-level words that are arguments of thefirst-level words.Likewise second-level words(e.g.,knows)have at least one ar-gument that is afirst-level word.For example,in John knows cats eatfish,the object argument of knows is a sentence containing afirst-level operator eats,which has two zero-level arguments cats andfish.This language component is concerned with classes of words but not with individual words,and permits strange or unlikely combinations,such as computers eatfish,as long as the dependency constraints are met.Inequalities of likelihood.The dependency relations exhibit different likelihood constraints.For example, certain arguments are more likely to occur with certain operators than with others.Thus,cat is more likely to occur as thefirst argument of eat than table or concept is.The likelihoods of an operator in respect to particular arguments are based on the frequency of operator-ar-gument combinations;some combinations occur fre-quently whereas others occur very rarely.In general language,the likelihood constraints on operators and their arguments are fuzzy,while in sublanguages the constraints are generally sharper.In either case,com-binations that have a very high likelihood create a low information situation,in which case zeroing of highly likely words may occur.For example,in general lan-guage,the indefinite noun something is often zeroed,as in I ate and I read,without loss of information.In the clinical language there is no loss of information if the noun patient and verb has is zeroed,as in fever.Paraphrastic reductions involve transformations of the sentences from a simple primitive form(which we typi-cally do not observe in text)to a complex form that consists of the actual sentences we see in textual docu-ments(i.e.,called the surface form).These reductions are paraphrastic in that they are associated with operations that change the structure of the sentence without changing the informational content.These reductions serve the purpose of eliminating information that is highly likely to occur and that is redundant by shorteningC.Friedman et al./Journal of Biomedical Informatics35(2002)222–235223and combining the original sentences so that they rep-resent a more efficient and compact form.For example, the sentence John has a brown hat,is reduced from a single sentence John has a hat;the hat is brown.When looking at natural language text,we do not see the original sentences because they generally have under-gone numerous transformations to become the surface forms that constitute the sentences we do see in text.Sublanguage grammar.When we look at sentences within a specialized domain,it is possible to observe particular word classes and particular statement types where the statement types generally contain operators that are much more restrictive than the dependency re-lations permitted in the general language and the like-lihood constraints are much more definitive.For example,in a general language it is permissible,al-though not sensible,to say John activated protein A, because the syntactic combination of word classes is well-formed.However,in the biomolecular language domain,this sentence is not legitimate because the op-erator activate permits only certain combinations of the word classes(i.e.,substance activate substance,process activate substance are allowed but person activate sub-stance is not),and the sublanguage operators reflect the salient relations and arguments that are meaningful in the specialized domain.Specialized sublanguages deal with specific subject matter(i.e.,textual radiological reports,discharge sum-maries or other patient documents,biomolecular litera-ture,medical literature,etc.).In this paper,we focus on two sublanguages within the biomedical domain:clinical reports and biomolecular relations found in the literature. The specifications of these sublanguages resulted in the implementation of two sublanguage grammars,which were used to process text and to extract and structure relevant information.It was possible to create these grammars because of the regularities and co-occurrence relations within each of the specialized sublanguages. Immunology literature is also in the biomedical domain, and Harris discussed the immunology sublanguage as presented in a companion article in the same issue of this journal.A more extensive analysis of the immunology sublanguage is discussed by Harris et al.[37].In the grammar of a specialized sublanguage,oper-ators and arguments still satisfy the dependency rela-tions of the whole language and paraphrastic reductions still occur,but the vocabulary is limited,only restricted combinations of words occur,and subclasses of words combine in specified ways with other subclasses.In a sublanguage,words form subsets from the larger word classes of the overall language.In the biomolecular domain,subsets of classes can be identified which cor-respond to words denoting proteins,genes,cells,tissues, and other biomolecular substances that constitute the relevant objects corresponding to the subject matter of the domain.Moreover,words that do not belong to relevant word classes of the domain(e.g.,pencil,desk) are excluded from the sublanguage.We can also form subsets from other classes,such as the verb class,that depend on the subclasses of the arguments that co-occur with the verbs.For example,the combination Fyn ac-tivates Cbl is a well-formed pattern because the sequencePROTEIN ACTIVATE PR OTEIN occurs regularly,but the combination liver cells bind to protein B(CELL ATTACHPROTEIN)is not allowed since that particular combi-nation of word classes never occurs.Thus,in order to create a sublanguage grammar,the critical task is to discover the subclasses and important relations.For each domain,clustering techniques[38] help to discover a limited number of word classes and sentence types for a large sample of a domain corpus. However,the sentences are in surface forms,and therefore,many reductions have occurred so that the sentences are complex and not necessarily in forms close to the underlying operator-argument forms,making the discovery task more difficult.There are several other elements of HarrisÕs theory concerning sublanguages.A sublanguage may differ from the whole language by omitting some grammatical properties of the language or by allowing different re-ductions.The domain-specific syntactic constraints and reductions are not necessarily the same as those of general English.We have observed this feature within the clinical domain because many well-formed sentences are telegraphic,in that they often are missing subjects and verbs,because that information is implicit in the context.For example,in a section of a report corre-sponding to chief complaints,sentences may consist of noun phrases only.In this context a noun phrase,such as pain,has an operator,such as is associated with the patient,which is expected in this context and therefore can be omitted.Another interesting observation is that sublanguages may overlap because they are associated with some of the same events or entities.For instance,the clinical and biomolecular domains are concerned with tissues and diseases,but the underlying relationships associated with them differ substantially.That may imply that similarities and differences in sublanguages or overlap-ping scientificfields may be quantifiable by measuring differences and similarities in words classes and their membership in co-occurrence patterns.3.Sublanguage featuresIn the following,we discuss features of languages in specialized domains that have important implications for the development of computerized natural language processing systems:•Semantic categorization of words.Relevant words can be categorized into subclasses or types of information224 C.Friedman et al./Journal of Biomedical Informatics35(2002)222–235where the types form the underlying subject matter of the domain.For example in the clinical domain there are informational categories,such as:disease,proce-dure,body location,and medication,whereas in the biomolecular domain some relevant categories are: gene,protein,amino acid,small molecules,and inter-action.•Co-occurrence patterns and constraints.Particular subclasses of information combine in particular co-occurrence patterns to form the meaningful relations of the domain.In the clinical domain a relation such as,PATIENT VERB have SYMPTOM IN BODYLOCATION(e.g.,patient experienced pain in joints)specifies that apatient is experiencing a symptom in a body location.In the biomolecular domain,a relation,such as protein1VERB interaction protein2(e.g.,Fyn activated Cbl),would be appropriate,whereas the former pat-tern most likely would not be.In addition,it is possi-ble to refine a class,such as INTERACTION,to further constrain the arguments it combines with.For exam-ple,if we create two subclasses of INTERACTION, called ACTIVATE and ATTACH,then we can refine the well-formed patterns so that PROTEIN VERB activate PROCESS(e.g.,Akt activates apoptosis) would be considered a valid pattern,whereas PRO-TEIN VERB attach PROCESS(e.g.,Cbl binds to apopto-sis)would not be.•Paraphrastic patterns.A set of patterns represent an equivalence relation where the patterns are different grammatically but represent the same underlying operator-argument structure.Thus,in the clinical do-main,the patterns BOD YLOCATIONBODYLOCATION verb be SYMPTOM(e.g.,joints were painful),BOD YLOCA-SYMPTOM IN BODYLOCA-TION(e.g.,pain in joints),SYMPTOM BOD YLOCA-TION(e.g.,painful joints),and BODYLOCATION SYMPTOM(e.g.,joint pain)all are equivalent to PER-SON VERB have BOD YLOCATIONSYMPTOM IN BODYLOCATION.Simi-larly,in the biomolecular domain,the patterns protein2verb be verb interaction BY protein1(e.g.,Cbl was activated by Fyn),and protein2noun interaction OF protein1(e.g.,Fyn activation of Cbl),are equivalent to protein1verb interaction protein2(e.g.,Fyn activated Cbl).It is therefore possible to choose one of the pat-terns as representative of the type of informational relationships conveyed by the set.•Omission of information.In a specific domain,when the context is known,additional contextual information is often expected or understood.According to Harris,in-formation that is expected is low in information con-tent and can be omitted because it is recoverable from the context.For example,in a radiological report of the chest,infiltrate noted is interpreted to mean infil-trate in lung was noted by radiologist.Omitted informa-tion is troublesome for language processing because a system must have additional knowledge in order to recover all the implicit information.•Intermingling of sublanguage patterns and general lan-guage.When looking at text of a domain,the sublan-guage patterns are often interspersed with general language that is not in the sublanguage,making the process of identifying sublanguage co-occurrence pat-terns difficult.For example,in he complained of a se-vere headache while working in the family store,and then fainted,the expression while working in the family store may be relevant to the patientÕs condition but is not in the clinical sublanguage because it does not contain the sublanguage entities or relations while the rest of the sentence does.•Terminology.Within specialized domains,words of-ten take on different meanings than in the general world domain,and therefore specialized domain vo-cabularies are needed to process domain text.For ex-ample,in the clinical domain capsule may denote a body location component,and in the biomolecular domain,associates may denote an interaction sense binds in addition to its general English sense.In mul-ti-word terms the issue is more complex because a term may have a meaning that is compositional and therefore denotes the meaning of the components, or that is non-compositional and denotes something different than the combined meaning of the compo-nents.For example,in radiological reports of the chest,no active disease not only means that there is no current disease activity but also denotes that there are signs of a previous condition in the X-ray.Thus, the phrase no active disease takes on a specialized meaning in the domain as if it were a single word.Ev-idence supporting this assumption is the frequent oc-currence of the phrase in radiological reports and the existence of a corresponding abbreviation NAD.The issue concerning compositionality is not a trivial one, and a number of articles have been published in the medical domain that discuss compositionality[39–41]and modification[42].The main issue is whether or not to treat certain multi-word terms,such as chest pain and chronic cough,as single words,or as words with modification.This issue is even more complex in the biomolecular domain because(1)verbs play a more significant role,(2)verbs frequently occur in the nominal form(e.g.,activation,activator),(3)ver-bal relations are often nested,and(4)substances are frequently given complex names that correspond to the functions they perform.Thus,inhibitor of mitogen activation may refer to the name of a particular entity or to a type of entity(i.e.,one that inhibits mitogen).In the case of mitogen-activated protein kinase there is evidence to support considering the phrase as a single name:the phrase is frequently found in the literature, and has a corresponding acronym,MAPK.We believe that more research on this topic is needed.The use of large domain corpora and statistical anal-ysis of frequency distributions may provide us withC.Friedman et al./Journal of Biomedical Informatics35(2002)222–235225objective criteria for determining whether or not a term should be treated as a single unit(called a collo-cation in the computational and statistical literature), or a compositional phrase composed of separate ele-ments.Manning and Sch€u tze[43]provide a discus-sion about linguistic and statistical considerations of collocations,which roughly are phrases that are not straightforward compositions of the parts.•Controlled vocabulary.Controlled vocabularies are usually associated with domain-specific terminology, well-defined concepts,and with a method for linking the terms in the terminology to the concepts.These vocabularies were generally defined based on expert knowledge,and as such are not definable according Harris because they were not established on distribu-tional grounds.A controlled vocabulary is important for computerized applications because it facilitates sharing of information among different systems by making it possible to map the multiple ways of ex-pressing a concept to one well-defined concept in the controlled vocabulary,thereby reducing the vari-ety of expression after the mapping.However the mapping between language and any controlled vo-cabulary is somewhat arbitrary and difficult to justify on objective grounds.In the clinical domain there are several controlled vocabularies,such as the UMLS[44],SNOMED[45],ICD-9[46],and MESH[47],which consist of well-defined or standard concepts corresponding to words and phrases in the domain.In the biomolecular domain there are a number of systems that are associated with terminology and controlled vocabularies,such as the Gene Ontology[48],GenBank[49],SWISS-PROT[50],and Locu-sLink[51].•Ontology.If applications using a controlled vocabu-lary require reasoning,a formal specification of the entities and relations in the domain(i.e.,an ontology) is very useful.An ontology of a domain may overlap with semantic classes associated with a domain sub-language.For example,the clinical domain and bio-molecular domain are likely to have classes corresponding to body location,disease,and medica-tion(classified as small molecules in the biomolecular domain).However,in an ontology,the classes are based on knowledge of the domain and are used to facilitate reasoning.In a sublanguage,classes are used to recognize,constrain,and interpret co-occur-rence patterns.A big difference between the two is that sublanguage patterns are obtained through ob-jective analysis of data,while ontologies are not nec-essarily constructed with direct empirical evidence.An ontology may be useful for natural language pro-cessing applications,but this is not necessarily true because the granularity of the classes may differ.For example,it is generally sufficient for a sublan-guage to have one coarse class called DISEASE(oran even coarser class FINDING)without having a complex hierarchy of disease subclasses because all the members of the class DISEASE generally have the same co-occurrence patterns.However,in an on-tology,it would be preferable to partition the class into subclasses since applications involving reasoning would generally treat the subclasses DISEASE,RESPI-RATORY DISEASE,PULMONARY DISEASEPU LMONARY D ISEASE,and PNEUMON IAPNEUMONIA differently because they are associated with different clinical properties.4.The sublanguage of the clinical domain4.1.BackgroundHarris provided a theoretical basis for sublanguage processing and identified semantic categories and co-occurrence patterns for several scientific languages.Two large scale natural language processing systems,the Linguistic String Project system[36]and the MedLEE system[52],were both based on HarrisÕs theories and were both applied to the clinical domain.However, these systems differ from Harris because the grammars they each use follow a constituent grammar formalism and not an operator-argument formalism.Below we present a brief overview of the sublanguage approaches of the two systems.The Linguistic String Project(LSP),headed by Sager et al.[36],was a pioneering effort in language processing in the general English domain and also in the clinical domain.A detailed version of a computerized English grammar and parsing system is described by Sager[53]. The LSP system was thefirst general comprehensive NLP system in the medical domain that contained a sublanguage grammar based on HarrisÕs sublanguage theory.The LSP system inspired several other systems, including the early version of PROTEUS[29],PUNDIT [54],KERNEL[55],and MedLEE[52].The LSP system established40clinical subclasses that denoted the rele-vant types of clinical information found in patient documents(i.e.,symptom,medication,body part),and 14general English semantic subclasses associated with verbs(e.g.,have,be),temporal information(e.g.,change, increase),evidential information(e.g.,no,present),and connective operators(e.g.,consistent with,and).In ad-dition,six types of semantic relations were established that corresponded to the representation of basic infor-mational relations associated with patient management (transferred to ICU),treatment excluding medication (intubated),medication(penicillin qd),tests and results (ppd positive),patient behavior(drinks excessively),and patient state(pain in joints).In the LSP system,each statement type could be thought of as a frame or template representation that denoted predetermined semantic relations among word226 C.Friedman et al./Journal of Biomedical Informatics35(2002)222–235classes.For example,a statement type corresponding to patient state would be a template with slots for the patient state,temporal information,body location, severity,and evidence where the latter four types of slots represent optional qualifiers of the patient state (e.g.,as in patient experienced severe pain in joints yesterday).Similarly,a medication statement type would have slots for medication,dose,frequency, manner,etc.(e.g.,on ampicillin2mg qid po).Inter-estingly,the six statement types accounted for the majority of the relevant relations in the clinical do-main.MedLEE,which was developed by Friedman et al. [52,56]is also a comprehensive language processing system in the clinical domain that relies on a sublan-guage grammar,which(i)specifies the subclasses in the language,(ii)delineates the structure of the language (i.e.,well-formed sequences of subclasses),(iii)interprets the relations among the elements of the structures,and (iv)specifies a representational form for each structure, which is consistent with a formal representational schema for medical language[52].The schema was de-signed on the basis of experience with the LSP system and a manual analysis of sample reports that were randomly retrieved from the clinical repository of pa-tient reports at Columbia-Presbyterian Medical Center (CPMC).MedLEE currently consists of53semantically rele-vant classes,most of which are shown in Table1,and several syntactic classes that have semantic interpreta-tions such as conjunction(e.g.,and,or),preposition (e.g.,after,in),and certain types of verbs(e.g.,involved, demonstrated).The original schema was designed to representfindings in radiological reports,and was ex-pressed in the format of conceptual graphs,but was later changed to a frame representation in the form of lists because it was more convenient computationally.Ac-cording to the MedLEE schema the main relations in a radiology report consist of primaryfindings with op-tional modifiers(e.g.,moderate left posterior central gyral hypodensity and connective relations between the primaryfindings(e.g.,CT scan revealed a hypodensity consistent with an infarct).In the two examples the primary information is underlined and the connective relations are shown in bold.When expanded to broader domains such as discharge summaries,additional basic relations were added to represent new types of primary clinical events,such as medication,laboratory tests, demographic information,and behavior.Examples of the primary types of information and target forms are illustrated in Table2.4.2.Clinical sublanguageTable1lists the semantic categories in the clinical do-main along with examples.Some of the classes correspond to primary types of information,such as BEHAVIOR, FIN DINGFINDING,and MEDICATION(e.g.,drinks,pain,and as-pirin).Additionally,some of the categories,such as FIN DINGFINDING,have been subdivided into subcategories Table1Semantic categories and subcategories in the clinical domain and examplesPrimarycategorySubcategory ExamplesADT Admitted,transferBehavior User,drinksBodyfunc Breathing,movement Bodymeas Pulse,weightDevice Catheter,atrial electronicpacemakerFindingCfinding Cardiomyopathy,diabetesmellitusDescriptor Patchy,egg shapedOrganism E.coli,StaphylococcusPfinding Enlarged,opacityLabproc Liver function test,SMAC Labtest Sodium,alkaline phosphatase Med Aspirin,ace inhibitorProc Biopsy,collapse therapyExamproc X-ray,spectral dopplerimagingSubstance Cigarettes,illegal substance Technique Underpenetration,expiredfilm Modifier categoryBodyloc Heart,respiratory system Certainty Possible,rule outChange Increased,came down to Degree Slight,extensive amount Diagmaterial Barium,indium131Ethnic Dominican,Hispanic Examphase Arterial phase,blood pool phase Examtype Serial,digital subtraction Family Mother,sisterFrequency Bid,times twoLocative BelowManner Intravenous,continuousinfusionPosition Axial,medial lateral oblique Ptactivity Awake,lying downPtdescr Twin,left handedQuantity Multiple,one halfRace Caucausian,blackReaction Respond,hypersensitiveRegion Left,right,upperService Emergency room,ICUSex Female,maleSpecialist Cardiologist,pathologist Speciality Cardiology,pathology Timeunit Day,morningUnit mg,centimeters squaredRelational operatorConjunction And,or,as well as,with Connective Accompanying,including,consistent withCertainty verbphraseAppeared,cannot be excludedC.Friedman et al./Journal of Biomedical Informatics35(2002)222–235227。