| UK | Germany |
| Home - Books - Science - Biological Sciences - Bioinformatics | Help | |
| 1-20 of 200 1 2 3 4 5 6 7 8 9 10 Next 20 |
click price to see details click image to enlarge click link to go to the store
| 1. Bioinformatics for Dummies by Jean-MichelClaverie, CedricNotredame, Jean-Michel Claverie, Cedric Notredame | |
![]() | list price: $29.99
our price: $19.79 (price subject to change: see help) Asin: 0764516965 Catlog: Book (2003-01-15) Publisher: For Dummies Sales Rank: 11685 Average Customer Review: US | Canada | United Kingdom | Germany | France | Japan |
|
Book Description That's where Bioinformatics For Dummies comes in. If you want to know what bioinformatics is all about and how to use it without wading through pages of computer gibberish or taking a course full of theory, this book has the answers in plain English. You'll find out how to Written by two experts who helped develop the science, Bioinformatics For Dummies is all about getting things done. If you're just getting your feet wet, start at the beginning with a quick review of those necessary parts of microbiology and an overview of the tools available. If you already know what you want to do, you can go directly to a chapter that shows you how. Get the lowdown on With an Internet connection and Bioinformatics For Dummies, you'll discover how to peruse databases that contain virtually everything known about human biology. It's like having access to the world's largest lab, right from your desk. This book is your lab assistant – one that never takes a day off, never argues when you ask it for help, and won't demand a benefits package. Reviews (6)
As a scientist who spends at least half of my time BLASTing, I also read it for accuracy and found it to almost error-free (any errors were in the figures). Additionally, most of the web pages were up-to-date, although as time passes the links will decay and web pages will change their look. In addition, the book contained enough in-depth content to teach me several new tricks of the trade. Further, I believe the book had sufficient background material to educate the novice. To test this, I gave the manual to a material science chemist and he was able to understand the material, at least until he decided it was more than he wanted to know and quit reading. This is a useful text for those who want to know more than an operational definition of bioinformatics and a must for the library of all bioinformatics users.
Even though I have been for a 4-day bioinformatics course (6 months ago), which I thought was pretty good, this book still had so much to offer. Using this book, I was easily able to substitute the proteins of my interest into their examples and generated meaningful hits. The book also covers deeper and more advanced features of BLAST, discusses sequence alignments using several types of algorithm and even has a section on 3D structures. Towards the end of book - it features a section on working with mRNA and building phylogenetics trees - which again are excellent resources for teachers involved in teaching beginners molecular biology. I am a teacher teaching at a Pre-unversity level. The way the book is structured also lends its material to be modified into lesson materials for training students. It is really a great book! Worth every dollar I spent on it!
| |
| 2. Algorithms on Strings, Trees, and Sequences: Computer Science and Computational Biology by Dan Gusfield | |
![]() | list price: $75.00
our price: $58.50 (price subject to change: see help) Asin: 0521585198 Catlog: Book (1997-01-15) Publisher: Cambridge University Press Sales Rank: 39321 Average Customer Review: US | Canada | United Kingdom | Germany | France | Japan |
|
Book Description Reviews (8)
No, there isn't any real source code here. That should not be a problem - this book aims above the cut&paste programmer. The book in meant for readers who can not only understand the algorithms, but apply them to unique solutions in unique ways. String matching is far too broad a topic for any one book to cover. The study can include formal language theory, Gibbs sampling and other non-deterministic optimizations, and probability-based techniques like Markov models. The author chose a well bounded region of that huge territory, and covers the region expertly. The reader will soon realize, though, that algorithms from this book work well as pieces of larger computations. The book's chosen limits certainly do not limit its applicability. By the way, don't let the biological orientation put you off. DNA analysis is just one place where string-matching problems occur. The author motivates algorithms with problems in biology, but the techniques are applicable by anyone that analyzes strings.
All of the major exact string algorithms are covered, including Knuth-Morris-Pratt, Boyer-Moore, Aho-Corasick and the focus of the book, suffix trees for the much harder probem of finding all repeated substrings of a given string in linear time. In addition to exact string matching, there are extensive discussions of inexact matching. Even the discussions of widely known topics like dynamic programming for edit distance are insightful; for instance, we find how to easily cut space requirements from quadratic to linear. There is also a short chapter on semi-numerical matching methods, which are also of use in information retrieval applications. Inexact matching is extended to the threshold all-against-all problem, which finds all substrings of a string that match up to a given edit distance threshold. The theoretical development concludes with the much more difficult problem of aligning multiple sequences with ultrametric trees, with applications to phylogenetic alignment for evolutionary trees (an approach that has also been applied to the evolution of natural languages). Note that there is no discussion of statistical string matching. For that, Durbin, Eddy, Krogh and Mitchison's "Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acides" is a good choice, or for those more interested in language than biology, Manning and Schuetze's "Statistical Natural Language Processing". There is also no information on more structured string matching models such as context-free grammars, as are commonly used to analyze RNA folding or natural language syntax. Luckily, Durbin et al. and Manning and Schuetze also provide excellent coverage of these higher-order models in their books. This book is not about efficient implementation. If you need to build these algorithms, you'll also need to know how to write efficient code and tune it for your needs. This is an algorithms book, pure and simple. As a computer scientist, I found the discussions of computational biology to be more enlightening than in other textbooks on similar topics such as Durbin et al., because Gusfield does not assume the reader has any background in cellular biology. Instead, he provides his own clear and gentle introductions illustrated with algorithms, applications, open problems and extensive references. Like most Cambridge University Press books, this one is beautifully typeset and edited.
This is the most complete resource i could find about suffix trees, how to implement them, usages, and algorithms. Actually, when I took this book, I was interested in suffix arrays. Well - this book explains those better than the original paper do. Many applications to suffix trees are listed, along with comparisons to other algorithms applied to those problems. If you need to get into string algorithms from computer science perspective - this is a good book to start. If you want to "feel" of the biologists side of the story, than this is not a good choice. I use this book as a textbook on the subject, and I'm sure I'll be using it as a reference later on. This book surely is worth its cost (even if you buy it on Amazon...:-)).
The author restricts his attention to deterministic approaches to string matching and comparison, and thus there is no treatment of hidden Markov models or Monte Carlo methods. The major algorithms such as the Aho-Corasick, Boyer-Moore, Knuth-Morris-Pratt, Needleman-Winsch, and Smith-Waterman are discussed and brilliantly motivated in the book. The author employs very effective diagrams to illustrate the matching concepts that are detailed in the book. The book does require some time to read but it is worth the effort. Also, the exercises can be challenging but some should he done in order to understand the concepts in the book. The empirical results of the algorithms as sequence databases are also included, with FASTA, BLAST, BLOCKS, BLOSUM, and PROSITE are discussed in detail. The chapter that discusses these is the least mathematical of all the ones in the book and was no doubt included to connect the reader with real-world applications of the techniques in the book. The last quarter of the book is a lot more trendy than the rest, with emphasis placed on algorithms for physical mapping, fragment assembly, and phylogenetic trees. These algorithms of course take on particular importance today given the Human Genome and other gene sequencing projects. Radiation-hybrid mappings, direct sequencing, and shotgun DNA sequencing are discussed in one of the chapters in this section, and the author addresses in great detail some approaches to speeding up sequence assembly. In the discussion on shotgun DNA sequencing the author refrains from any probabilistic analysis, instead referring the reader to the references. This omission goes along with the rest of the book, where probabilistic methods are not used, which is a little disappointing since these have shown great promise in computational biology. The exercises at the end of the chpater are very interesting and it is worth spending time working some of them through. In a later chapter, the solution of the satisfiability problem in mathematical logic is discussed and shown to be solved (at least theoretically) by DNA-based computing. The quantities of DNA needed to carry out the computation are shown to be infeasible by the author. This book will no doubt be of great assistance to those interested in the more rigorous approaches to computational biology. But the best attribute of the book is that one gets the impression that the author had a good time writing it, and that shows through in this very important book. ... Read more | |
| 3. Bioinformatics: A Practical Guide to the Analysis of Genes and Proteins, Third Edition | |
![]() | list price: $79.95
our price: $79.95 (price subject to change: see help) Asin: 0471478784 Catlog: Book (2004-10-15) Publisher: Wiley-Interscience Sales Rank: 70570 Average Customer Review: US | Canada | United Kingdom | Germany | France | Japan |
|
Book Description
Reviews (13)
I read the review by "a reader in Cambridge, MA", and don't understand what their beef is with this title. The authors have tried (and have succeeded) in pointing the readers to the best PUBLIC DOMAIN software out there, augmenting documentation that's generally lacking. Have you ever tried finding good docs on the NCBI Web site? Well, these two editors got them for you. UNIX-centric? I can't speak for the first edition, but check out the second edition and see that there's tons of Netscape screen dumps demonstrating the tools and making things as easy as possible for the reader. I originally bought this because of the reviews published in Science and Cell and a slew of other journals, all favorable, so the "reader in Cambridge" seems out of step with all of the published journal reviews of the book. Everyone's entitled to their opinion, but I just wanted to point this out for a sense of balance here, especially since my own experience was so different.
I'd have to agree with the other reviewer that Chapters 1 & 17, which constitute 10% of the book, are wasted paper. No one in 2001 (when the book was published), let alone 2004, needs Chapter 1's lengthy explanation of what e-mail and web browsers are. And the perl program at the anticlimax of Chapter 17 was ... anticlimactic. The book is to a great extent a catalog of available software tools. With the exception of the chapters on multiple alignment and phylogeny, the emphasis is on not on how the tools work but how to operate them -- to the of saying "at this URL there is a web page where you can either paste in your sequence or upload a file". The idea of invoking a program through a Unix command line is more than once presented as a truly daunting prospect. The authors generally do a good job of emphasizing that the programs are the beginning of analysis and not the end; the results must always be viewed somewhat skeptically with an expert eye. If you're coming at the book as a biologist, you will probably find it to be a useful catalog of software, though undoubtedly dated by now. If you're coming at it from the informatics side, you're going to need some background... a book like Dwyer's, Setubal and Meidanis's, or Mount's will get you up to speed on the algorithm aspects of the field with simplified versions of many of the big problems. Then you can look at this book to find good pointers to the ways the real-world versions have been addressed. The book was published three years ago and, being to a large extent an index of the work of others, is necessarily no longer up to date in a fast-moving field. It needs a revision and, in the meantime, it would make more sense to snag a used copy than to pay full price for a new book.
I think this is a good, brief introduction to the wide variety of bioinformatic tools and databases on the internet. It describes the major features of each, and the kinds of results that each tool is good for. After that, the serious user will go to the sources of each tool or database, to learn more about the specifics as of the moment. No book can hope to keep up with the weekly enhancements at the major repositories. I emphasize that this is for tools users, not tool makers. It addresses the working scientists who already know their subjects and their needs. This skips over the algorithms in favor of higher level descriptions, and skips over many of the biological reasons for the tools described. Better-informed tool users get better answers from the tools, true. At some point, though, the biologists want to skip the theory, skip the introduction to subjects in which they're experts, and get on with their science. I don't think this book was ever meant for people - and I'm one - who want full details of the algorithms. I agree, the book treats its many subjects in a shallow way. I think that is by intent, since the book's real goal is breadth and its target is a reader who knows the basic science. It's a bit off the center of my interests, but I've found it helpful.
| |
| 4. Beginning Perl for Bioinformatics by James Tisdall | |
![]() | list price: $39.95
our price: $26.37 (price subject to change: see help) Asin: 0596000804 Catlog: Book (2001-10-15) Publisher: O'Reilly Sales Rank: 20851 Average Customer Review: US | Canada | United Kingdom | Germany | France | Japan |
|
Amazon.com Departing from O'Reilly's earlier monograph Developing Bioinformatic Computer Skills, Tisdall's text is organized aggressively along didactic lines. Nearly all of the 13 chapters begin with twin bullet lists of Perl programming tools and the bioinformatic methods that require them. Likewise, the chapters end with exercises. String concatenation is illustrated with gene splicing, and regular expressions are taught with gene transcription and motif searching. Tisdall emphasizes sequence examples throughout, leading up to an introduction to a Perl interface for the NIH GenBank biological database and the widely used BLAST sequence alignment tool. After a brief discussion of three-dimensional protein structure, he returns to sequence extraction and secondary structure prediction. Tisdall's goal is to boost the beginning programmer into a domain of self-learning. He imparts essential etiquette for the success of programming newbies: use the wealth or resources available, from user documentation to Web site surveys to FAQs to How-To's to news groups and finally to direct personal appeals for help from a senior colleague. A well-plugged-in bioinformatics Perl student will soon discover Bioperl, an open-source effort to bring research-grade bioinformatic tools to the Perl community. Bioperl is described briefly at the end of Tisdall's book and will reportedly be a forthcoming title of its own in the O'Reilly bioinformatics series. Although he introduces bioinformatics as an academic discipline, Tisdall treats it as a trade throughout his book. He indicates that open questions and computational hard problems exist, but does not describe what they are or how they are being tackled. Ultimately, Tisdall presents bioinformatics as another arrow in a bench scientist's quiver, very much like HPLC, 2D-PAGE, and the various spectroscopies. As odd as a "bioinformatics-as-tool" book may be to its research proponents, the reduction of bioinformatics to trade status both deflates and vindicates the years of research, as Tisdall's work attests. --Peter Leopold Reviews (16)
Superficially, this book isn't all that different from a lot of introductory Perl books: the Perl material starts out with an overview of the language, followed by a crash course on installing Perl, writing programs, and running them. From there, it goes on to introduce all the various language constructs, from variables to statements to subroutines, that any programmer is going to have to get comfortable with. Pretty run of the mill so far. Tisdall starts with two interesting assumptions, though: [1] that the reader may have never written a computer program before, and so needs to learn how to engineer a robust application that will do its job efficiently and well, and [2] that the reader wants to know how to write programs that can solve a series of biological problems, specifically in genetics and proteomics. As such, there is at least as much material about the problems that a biologist faces and the places she can go to get the data she needs as there is about the issues that a Perl programmer needs to be aware of. The author introduces the reader to the basics of DNA chemistry, the cellular processes that convert DNA to RNA and then proteins, and a little bit about how and why this is important to the biologist and what sorts of information would help a biologist's research. The main sources of public genetic data are noted, and the often confusing -- and huge -- datafiles that can be obtained from these sources are examined in detail. With the code he presents for solving these problems, Tisdall makes a point of not falling into the indecipherable-Perl trap: this is a useful language, well-suited to the essentially text-analysis problems that bioinformatics means, and he doesn't want to encourage the kind of dense, obscure, idiomatic coding style that has given Perl an undeservedly bad reputation. Some of Perl's more esoteric constructs are useful, and they show up when they're needed, but they're left out when they would only serve to confuse the reader. This is a good decision. Rather, the focus is on teaching readers how to solve biological problems with a carefully developed library of code that happens to leverage some of Perl's most useful properties. The result is pretty much a biologist's edition of Christiansen & Torkington's Perl Cookbook or Dave Cross' Data Munging With Perl. The author presents a series of issues that a working bioinformaticist might have to deal with daily -- parsing over BLAST, GenBank, and PDB files, finding relevant motifs in that parsed data, and preparing reports about all of it. If a bioinformaticist's job is to be able to report on interesting patterns from these various sources, then following the programming techniques that Tisdall explains in clear, easy-to-follow prose would be an excellent way to go about doing it. And when I say "programming techniques," note that I'm not specifically mentioning Perl. The code in this book is clear and organized, and all programs are carefully decomposed into logical subroutines that are then packaged up into a library file that each later sample program gets to draw from. Each new program typically contains a main section of a dozen lines of code or less, followed by no more than two or three new subroutines, along with calls to routines written earlier and called from the BeginPerlBioinfo.pm that is built up as the book progresses. Each sample is typically preceded by a description of what it's trying to accomplish and followed by a detaild description of how it was done, as well as suggestions of other ways that might have worked or not worked. This modular approach is fantastic -- too many Perl books seem to focus so heavily on the mechanics of getting short scripts to work that they lose sight of how to build up a suite of useful methods and, from those methods, to develop ever-more-sophisticated applications. It isn't quite object-oriented programming, but that's clearly where Tisdall is headed with these samples, and given a few more chapters he probably would have started formally wrapping some of this code into OO packages. If I have a complaint with the book, in fact, it's that Tisdall doesn't go any further: everything is good, but it ends too soon. Seemingly important topics such as OO programming, XML, graphics (charts & GUIs), CGI, and DBI are mentioned only in passing, under "further topics" in the last chapter. I also have a feeling that some of the biology was shorted, and the book barely touches upon the statistical analysis that probably is a critical aspect of the advanced bioinformaticist's toolbox. I can understand wanting to keep the length of a beginner's book relatively short, and this was probably the right decision, but it would have been nice to see some of the earlier sample problems revisited in these new contexts by, for example, formally making an OO library, showing a sample program that provided a web interface to some of the methods already written, or presenting code that presented results as XML or exchanged them with a database. But these are minor quibbles, and if the reader is comfortable with the material up to this point, she shouldn't have a hard time figuring out how to go a step further and do these things alone. It's a solid book, and one that should be able to get people learning Perl, genetics, or both up to speed and working on real world problems quickly.
Also, I do not like the fact that it uses "quick and dirty" Perl (no "use strict" pragma). While it might be less confusing to skip it at the very beginning, very soon students start to waste too much precious class time trying to locate bugs that would make the program not compile with "use strict" in the first place (e.g. mistyped variable names).
The consensus in the field seems to be that it's more productive (and certainly easier) to teach biologists how to program, rather than try to get programmers up to speed on the intracities of molecular biology. For similar reasons, Perl is a popular language to learn: it's easy to get off the ground and be productive with it, without requiring a heavy computer science background. (This, of course, has downsides as well...) Never one to miss out on a trend, I'm going to be teaching a course on Bioperl and advanced Perl programming, starting next fall, which means I'm doing a lot of reading in this topic area, trying to develop lectures and find good background reading material. One of the first books I grabbed was _Beginning Perl for Bioinformatics_, which has been sitting on my "to read" shelf since O'Reilly sent me a review copy in December of 2001. It's a typical O'Reilly "animal" book (the cover bears three tadpoles), which does a decent job of introducing the basic features of the Perl language, and it should enable a dedicated student to get to the point where she can produce small useful programs. However, I'm not completely happy about the book's organization, and I think the occasional "if you're not a biologist, here's some background" interjections could have been cut without hurting anything. The initial chapters in the book cover "meta" information, such as theoretical limits to computation, installing (or finding) the Perl interpreter on your computer, picking a text editor, and locating on-line documentation. Some general programming theory stuff is covered as well -- the code-run-debug cycle, top-down versus bottom-up design, the use of pseudocode. There's also some biology background, but it's very introductory level stuff -- DNA has four bases, proteins are made of 20 amino acids, and so on. In chapter four, the book begins to get into actual Perl, with some coverage of string manipulation. Examples deal with simulating the transcription of DNA into RNA. Chapters five and six continue to flesh out the language, covering loops, basic file I/O, and subroutines. Chapter seven introduces the rand() function, in the context of simulating mutations in DNA. Subsequent chapters introduce the hash data type (using a RNA->protein translation simulation), regular expressions (as a way to store the recognition patterns of restriction endonucleases), and parsing database flat files and BLAST program output. I'm clearly out of the target audience of the book, as I already have a strong working knowledge of Perl. Perhaps that's why I found the order that concepts were presented in to be a bit strange -- for example, hashes, which are a fundamental data type, aren't introduced until halfway through the book, and regular expressions (one of the key features of Perl) first appear even later. As I said above, I also found the biological background sections to be more distracting than anything, but I've also got a strong biology background, so perhaps I'm off base here too. That said, I think a person with a CS background would be better served with a copy of _Learning Perl_ and an introductory molecular biology text than with this particular book. One of the things I did enjoy about the book were the frequent coding examples, all of which presented realistic computational biology sorts of problems and then demonstrated how to solve them. I'm sure that when I get around to writing lectures, I'll be leafing through this book looking for problems I can use in class. Overall, recommended for biologists without programming experience who would like to get started using Perl for simple programming. Not recommended for people with computer science backgrounds looking to get into bioinformatics.
| |
| 5. Bioinformatics: Sequence and Genome Analysis by David W. Mount | |
![]() | list price: $75.00
our price: $75.00 (price subject to change: see help) Asin: 0879696087 Catlog: Book (2001-03-15) Publisher: Cold Spring Harbor Laboratory Press Sales Rank: 213515 Average Customer Review: US | Canada | United Kingdom | Germany | France | Japan |
|
Book Description It is written for any biologist who wants to understand methods of sequence and structure analysis and how the necessary computer programs work Sequence alignment, structure prediction, phylogenetic and gene prediction, database searching, and genome analysis are clearly explained and amply illustrated Underlying algorithms and assumptions are clearly explained for the non-specialist Examples are presented in simple numerical terms rather than complex formulas and notation Theoretical underpinnings are linked to biological problems and their solutions Extensive tables provide descriptions and Web sources for a broad range of publicly available software Based on the author's extensive experience as a molecular geneticist and bioinformaticist at the University of Arizona, this is a uniquely educational book, ideal as a laboratory reference for investigators and also as teaching reference for graduate and undergraduate students studying this fast-changing discipline. Reviews (14)
I purchased this book a while ago. At that time, the book was really difficult to read. I thought that it is because I do not have enough knowledge to understand the material. So I stopped reading this book and studied bioinformatics by other means. After gaining enough knowledge in bioinformatics, I re-opened this book, and it is funny to find that I still have the same amount of difficulty in understanding what the author wrote about topics that I have already built good understanding. Reading this book will only deteriorate one's understanding. Several years ago, only just a few books were available on the market, so one needed to purchase this book. These days, there are lots of varieties to choose, and any choice is likely to be better than this book.
This book has a good coverage of FASTA and The programming techniques coverd are bare. Though I am yet to find a good book that deals only with On the whole this book helped me understand a lot If you are reading this review pls understand that I am Hope this helps Santy
So far, the best there is for a survey course - but for depth and accuracy in sequence analysis algorithms, go to Durbin et al or Gussfield. ... Read more | |
| 6. Statistical Methods in Bioinformatics by Warren J. Ewens, Gregory R. Grant | |
![]() | list price: $89.95
our price: $89.95 (price subject to change: see help) Asin: 0387952292 Catlog: Book (2001-04-20) Publisher: Springer-Verlag Sales Rank: 216007 Average Customer Review: US | Canada | United Kingdom | Germany | France | Japan |
|
Book Description Warren Ewens is Professor of Biology at the University of Pennsylvania. He is the author of two books, Population Genetics and Mathematical Population Genetics, and has served on the editorial boards of Theoretical Population Biology, GENETICS, Proceeding of the Royal Society B and SIAM Journal in Mathematical Biology. He was recently awarded the Gold Medal of the Australian Statistical Society and elected as Fellow of the Royal Society. His research interests are in evolutionary population genetics, linkage analysis for human diseases, and bioinformatics. Gregory Grant is a bioinformatics researcher at the University of Pennsylvania in the Computational Biology and Informatics Laboratory (CBIL), where he has been since 1998. In 1995 he received a Ph.D. in Mathematics from the University of Maryland and in 1999 a Masters in Computer Science from the University of Pennsylvania. His research interests are in bioinformatics in general and in particular in the statistical analysis of gene expression data and significance testing methods for IBD-mapping. Reviews (5)
A topic such as the two-sample t-statistic is scattered throughout the book, with the main part not even cited in the index! Unfortunately there are not a lot of books in the field of Statistics in Bioinformatics. However, I would recommend "The Elements of Statistical Learning" (Hastie et al.) for classifiers etc (Duda and Hart's classic is also good). I would recommend "Biostatistical Analysis" by Zar for a general coverage, and Terry Speed's "stat Labs: Mathematical Statistics ..." which is not comprehensive but has good lab examples with associated statistical analysis.
Chapter one begins, appropriately, with an introduction to probability theory, with a consideration of discrete probability distributions of one variable beginning the chapter. The Bernoulli, binomial, uniform, geometric, generalized geometric, and Poisson distributions are discussed. The authors point out the use of geometric-like distributions in the BLAST application. The also caution the reader as to the difference between the mean and the average of a random variable. They then move on to consider continuous distributions, discussing briefly the uniform, Normal, exponential, gamma, and beta distributions. Moment-generating functions are also introduced, and they prove a "convexity" theorem for these functions that is important in the BLAST application. The authors also introduce the relative entropy and generalized support statistics, the later also being used in BLAST. The next chapter is an overview of probability theory in many random variables. The results in chapter one are discussed in this context, and the authors give an interesting application to the sequencing of EST libraries. The authors also point out that the variance of the maximum of a collection random variables is finite as the number of variables increases, a fact that is used quite often in bioinformatics. Transformations of random variables are also discussed, with the goal of showing how these can be used to find the density function of a single random variable, this also being important in BLAST. The most important subject of the book begins in chapter 3, wherein the authors introduce statistical inference. They begin with a very brief discussion of the differences between the frequentist and Bayesian approaches to statistical inference and then move on to classical hypothesis testing and nonparametric tests. This chapter is of great value to those readers, for example biologists/would-be bioinformaticists who are approaching statistics for the first time. Chapter 4 introduces concepts that are of upmost importance in probabilistic computational biology, namely Markov chains. The discussion in this chapter sets up the strategies used in the next chapter on analyzing a single DNA sequence and a latter chapter on hidden Markov models. Shotgun sequencing is discussed as a tool to determine the an actual DNA sequence, and the authors discuss the probabilistic issues that arise in the reconstruction of long DNA sequences from shorter sequences. Missing in this chapter is a mathematical analysis of the advantages/disadvantages between shotgun and whole genome sequencing strategies. Chapter 6 then generalizes the analysis of chapter 5 to multiple DNA and protein sequences. It is here that one begins to talk about alignments between sequences, which bring about some very subtle mathematical problems in computational biology. The computational complexity of the (global) alignment problem entails the use of softer techniques, such as dynamic programming, which is discussed in this chapter. The (local) alignment problem is also discussed in some detail, using the linear gap model. The alignment problem and the issues with scoring for protein sequences are also discussed in detail. The reader first encounters the famous PAM and BLOSUM matrices in this chapter. The authors do not discuss any connections with the protein folding problem, unfortunately. The next chapter introduces the basic probability theory behind the BLAST algorithm, namely random walks. They do so with emphasis on moment generating functions, which might be a little abstract for the biologist reader. The authors return to tatistical estimation and hypothesis testing in chapter 8, with maximum liklihood and fixed sample size tests discussed in some detail. Again connecting with the BLAST algorithm, the sequential probability ratio test is treated. The authors finally get down to the BLAST algorithm in chapter 9, using an older version of the software (1.4). The connection of the algorithm with random walks and how to assign scores is immediately apparent, as is the ability of BLAST to do database queries against a chosen sequence. The algorithm is compared with the sequential analysis discussed in the last chapter. The authors return to Markov chains in chapter 10, and give some numerical examples. In addition, they treat the important topic of Markov chain Monte Carlo via the Hastings-Metropolis algorithm, Gibbs sampling, and simulated annealing. An application of simulated annealing to the double digest problem is described. The authors also spend a litte time discussing continuous-time Markov chains. Hidden Markov models are finally discussed in chapter 11. These have been the most effective tools in sequence analysis and the authors give a nice overview of their construction and properties in this chapter. The Pfam package is discussed as a software implementation of HMMs for determining protein domains. Unfortunately, they do not discuss the excellent package HMMER for implementing HMMs in sequence analysis. Chapter 12 discusses computationally intensive methods in classical inference. One of these methods, the bootstrap procedure, which is used for large sample sizes, is described. Used to estimate confidence intervals in situations where there is not enough information to employ classical methods, the authors detail a method using quantiles to estimate the confidence interval for the standard deviation of the expression intensity of a gene. This is followed by a return to the multiple testing problem of chapter 3 in the context of the data analysis of expression arrays. I did not read the last two chapters on evolutionary models and phylogenetic tree estimation so I will omit their review.
This book is the first exception I know of. It builds, and rests on, solid foundations of genetic stochastic processes and still goes all the way to real-life problems. Let me illustrate this by means of an example, rather than enumerating all the topics in the book. Chap. 14, entitled `phylogenetic tree estimation' (as opposed to the more common term `phylogenetic tree reconstruction' - not without reason, I presume) builds on, and is firmly interlaced with, Chap. 13 about `evolutionary models', which systematizes the zoo (if not jungle) of substitution models in both discrete and continuous time. On this basis, the overview of tree-building methods makes a lot of sense. Even better, it does not stop here, but presents an application (to real sequence data), followed by a careful analysis of where the various methods agree, and where - and maybe why - they disagree. This way, it clears away some common misconceptions; in particular, it presents a careful analysis of what bootstrap does and what it does not in this context. The chapter closes with a discussion of unresolved problems (like inhomogeneity of substitution rates), and methods and possible pitfalls related to testing of nested and non-nested hypotheses in tree estimation. The book is written in an informal style without being imprecise, which makes it pleasant reading. It is particularly suitable for teaching at a high level. This is enhanced by realistic (and even real-life) examples that furnish the text, as well as carefully chosen exercises at the end of each chapter. Certainly, this first edition of `Statistical Methods in Bioinformatics' cannot be the last word in this fast-moving field. But it is an excellent guide into the `right' direction.
The authors appear not to have much personal experience with sequence analysis and their exposition seems to be dominated by suggestions from not very honest or objective colleagues. At least that much can be inferred from the list of references given at the end of the book and the content of sequence-analysis-oriented chapters 5, 6 and section 11.3 of chapter 11. On the other hand, chapters 9 (about BLAST statistics), 13 (about evolutionary models), and 14 (about phylogenetic trees) are excellent. Every practicing bioinformatician should read them as a required reading before doing anything with BLAST or with construction of evolutionary trees. Chapter 12 about computationally intensive methods is also very well written. However, the authors fail to notify the reader that many of the methods (such as bootstrap) have a really bad reputation among researchers involved in sequence analysis. Perhaps at least one sentence of warning (with references) could be in order. In summary: The book is a mixed blessing but I would recommend it to statisticians who desire to do some work in bioinformatics. I also believe that chapters 9, 13 and 14 should be read by all practicing bioinformaticians.
Topics include basic probability and statistical inference, Poisson processes and Markov chains, DNA sequencing, hidden Markov models, computer intensive methods, evolutionary models and phylogenetic tree estimation. Of particular interest to me is the material on permutation methods and the bootstrap. The bootstrap has been applied in phylogenetics and there has been some controversy about its application there. The authors cover this in Chapter 14 where they appear to have a resolution for the controversy. Permutation tests are first discussed in Chapter 3 "A Introduction to Statistical Inferrence" and are compared with other computer intensive methods in Chapter 12. In Section 12.3 they discuss the Behrens-Fisher problem pointing out why permutation tests are not possible due to the unequal variances. They give the bootstrap t solution. Section 12.2.2 gives a brief, but nicely described, account of bootstrap estimation and confidence intervals and provides a number of references including the following books: Efron and Tibshirani (1993), Davison and Hinkley (1997), Efron (1982), Hall (1992), Manly (1997), Sprent (1998) and Chernick (1999). Bootstrap and permutation approaches to multiple testing are covered in Section 12.4. ... Read more | |
| 7. Bioinformatics and Functional Genomics by Jonathan Pevsner | |
![]() | list price: $94.50
our price: $85.05 (price subject to change: see help) Asin: 0471210048 Catlog: Book (2003-10-31) Publisher: Wiley-Liss Sales Rank: 357602 Average Customer Review: US | Canada | United Kingdom | Germany | France | Japan |
|
Book Description Reviews (3)
Now, for the book itself. It is easy to read and covers all aspects of bioinformatics from a sequence perspective (information retrieval, BLAST, gene expression and microarrays, proteomics and protein bioinformatics, genomes and disease). The coverage of databases and URLs is thourough and the text is easy to read, yet useful. The book is comprehensive with one area seemingly missing -- it would have been useful to include a chapter on systems biology and/or cellular modeling and the tools available (i.e. E-Cell). The book is especially useful to a researcher who is trying to explore all aspects of a particular gene, protein, disease, or pathway using bioinformatics tools. The book is in stark contrast to the other Pevser (that is Pevzner) who wrote a bioinformatics book that surveyed algorithm theory underlying bioinformatics. This book is also useful for less technical professionals in industry -- the managers, lawyers and venture capitalists that pervade the biotech landscape all need to communicate effectively and they can surely learn that here, provided they have some background in cell biology first.
| |
| 8. Discovering Genomics, Proteomics, and Bioinformatics by A. Malcolm Campbell, Laurie J. Heyer | |
![]() | list price: $81.00
our price: $81.00 (price subject to change: see help) Asin: 0805347224 Catlog: Book (2002-09-13) Publisher: Benjamin Cummings Sales Rank: 77077 Average Customer Review: US | Canada | United Kingdom | Germany | France | Japan |
|
Book Description Reviews (1)
Recommended to students: yes, together with classic works like Brown. Recommended to Central Library: yes. 1. The supplied CD-ROM is a nice teaching aid. Yet, it is difficult to "extract" pictures from it for teaching purposes. It would be much more useful if the pictures were individually supplied in standard high-quality graphic formats like TIFF, instead of PDF. The later is perfect for distributing text with pictures, but not to retrieve such pictures. Other publishers distribute the book artwork as individual TIFF files. That approach greatly enhances the book and boost sales. This is particularly useful for teachers. Actually, it is a must for us these days. Please, make sure that future versions of the CD-ROM or DVD-ROM are --as this one-- compatible with the open-source Unix-based Mac OS X platform. Thanks. 2. The associated web page "Instructor's Guide" 4. Math minutes are an excellent idea. 5. Boxes are welcome. Please, include more. 6. Also helpful are the boldface words on each chapter. Perhaps they could be also included in a keywords at the beginning of each chapter. 7. The index should be more comprehensive and should have all main entries in boldface. This is important to any index and very few books have it right. 8. The glossary is helpful. It should be more comprehensive, 9. The summaries and conclusions are great, yet should be expanded to include more relevant information. They should be like a "minichapter" an the end of each chapter or --better-- at the beginning. All partial summaries could be pooled into a larger summary that way. 10. Addendum sections could be included as separate notes or boxes. 11. The pronunciation tips for new words are also an excellent idea; mostly for non-English speakers. 12. The classified references are really useful. Well done. If they were commented or "annotated" they would be just perfect. 13. A list of abbreviations would be welcome. A list sorted by the full name would be very handy as well. 14. What about telomerase and aging? What about the fact that 15. It should be clearly indicated the organisms with genomes made of dsDNA, ssDNA, dsRNA and ssRNA. 16. Missing bioinformatics tools and step-by-step analysis of genes and mRNA (see next) and whole genomes. 17. It would be really helpful to explain clearly and analyze --even from a bioinformatics point of view-- the structure of genes, mRNA, CDS, introns, exons, promoters and terminators. It is not clear where do these elements start or end or how to recognize them. Diagrams and graphs would greatly help to explain these absolutely basic and fundamental concepts. In other words, imagine that you have cloned and sequenced a genomic gene as well as a full mRNA (cDNA). Now you want to publish your results and for that you do a comprehensive description of your gene (chromosome) and cDNA (mRNA). That is precisely the kind of information that is missing as a diagram and explanation. In this way, it should be indicated that you may encounter several ATG (or other) starting coding triplets in the mRNA, that if the 20 or so amino acid residues of the 5'-end of a peptide have a high percentage of hydrophobic residues, they are likely part of a leading peptide which would be further excised, that you may encounter several polyadenylation signals, etc. On the genome side, the promoter and terminator structures should be analyzed, as well as the intron-exon boundaries. 18. Likewise, it should be indicated the tools and current 19. Does not mention Lasergene package of DNAStar 21. Differential display methodologies are missing (as well as other methodologies of gene expression like subtractive hybridization). 22. Large-scale sequencing is missing. For instance, sequencing of single-molecules will allow the sequencing of whole chromosomes or genomes. 23. Missing tables comparing different genomes with full details 24. Reference to manufacturers is very useful. Please, include also links to web sites. Best if all manufacturers are included as an appendix. 25. All web sites (NCBI, etc) and web-based applications (BLAST, ORF Finder, etc) should be clearly indicted in an appendix. 26. It is not indicated that the PCR was in fact described with full details by Khorana et al 14 years before Mullis et al. 27. Please, include more drawings and pictures in the printed book and CD-ROM. 28. Suggestion: including chapters on eukaryotic-genomic DNA 29. Suggestion: including chapters on plant and animal transformation. 30. Suggestion: including drawing of Maxam-Gilbert sequencing method and Sanger method (Applied Biosystems electropherograms, 31. Prions, viroids and viruses could be also included. 32. A title index at the beginning of each chapter would be very 33. Bioinformatics could be significantly expanded. 34. QuickTime videos explaining some topics would be fantastic. 35. All in all, a great novel approach. Keep up the great work!
... Read more | |
| 9. An Introduction to Bioinformatics Algorithms (Computational Molecular Biology) by Neil C. Jones, Pavel A. Pevzner | |
![]() | list price: $55.00
our price: $44.00 (price subject to change: see help) Asin: 0262101068 Catlog: Book (2004-08-01) Publisher: Bradford Books Sales Rank: 34569 US | Canada | United Kingdom | Germany | France | Japan |
|
Book Description | |
| 10. Mastering Perl for Bioinformatics by James D. Tisdall | |
![]() | list price: $39.95
our price: $26.37 (price subject to change: see help) Asin: 0596003072 Catlog: Book (2003-06) Publisher: O'Reilly Sales Rank: 67262 Average Customer Review: US | Canada | United Kingdom | Germany | France | Japan |
|
Book Description Reviews (2)
Naturally, this book picks up where the earlier one left off, diving headfirst into the details of Perl modules. Chapter two is a quick pass over some basic data structures, with discussion of how you'd implement each in Perl. Subsequent chapters cover object-oriented programming in Perl, using Perl with relational databases, using Perl with web services, generating graphics on the fly with Perl, and the use of the Bioperl suite of libraries. As might be expected, all the coding examples in the book are drawn from reasonably realistic bioinformatics situations. There's a little bit less hand-holding on the biological side in this book, relative to the earlier volume -- which I think is a good idea, as it gives more space to focus on the programming material. The one weakness of this book is that it covers quite a few topics, which means that it doesn't really go into great depth on any of them. The "survey" approach is well done, and it's very nice to have biologically relevant examples and exercises for the breath of material that is addressed, but I think the book might have been stronger if it forewent the "Perl and the Web" and "Perl and Graphics" chapters in favor of more focus on the Bioperl libraries. If you're a bioinformatics programmer who enjoyed "Beginning Perl for Bioinformatics", and you want to get a better idea of what more advanced Perl programming looks like and what sorts of things you can do with Perl, this book is a nice place to start. However, if you're looking for more specific information, other more focused books might be a better choice, if you can live without the biologically focused code examples.
You will learn the OOP aspects of Perl in the context of biological problems. For example, this book explains the OOP concepts by developing a Gene class that stores information about a gene. In other books such as Conway's "Object oriented Perl", CD::Music class is given as an example, which I found very boring. The leap from procedural Perl scripting to object-oriented Perl programming is nontrivial to learn, and I'm sure that this book helps. Bioperl is the subject of chapter 9 (about 40 pages), and it could have been better if there were more thorough treatment about this module. If you are new to Perl, read first "Beginning Perl for Bioinformatics", then this book, because this book assumes that you already have the level of Perl knowledge that can be acquired by that previous book. ... Read more | |
| 11. Statistical Methods In Bioinformatics: An Introduction (Statistics for Biology and Health) by W. J. Ewens, GREGORY, R. GRANT, Warren J. Ewens, Gregory Grant | |
![]() | list price: $89.95
our price: $89.95 (price subject to change: see help) Asin: 0387400826 Catlog: Book (2005-02-28) Publisher: Springer-Verlag Sales Rank: 552935 US | Canada | United Kingdom | Germany | France | Japan |
|
Book Description Advances in computers and biotechnology have had a profound impact on biomedical research, and as a result complex data sets can now be generated to address extremely complex biological questions. Correspondingly, advances in the statistical methods necessary to analyze such data are following closely behind the advances in data generation methods. The statistical methods required by bioinformatics present many new and difficult problems for the research community. This book provides an introduction to some of these new methods.The main biological topics treated include sequence analysis, BLAST, microarray analysis, gene finding, and the analysis of evolutionary processes.The main statistical techniques covered include hypothesis testing and estimation, Poisson processes, Markov models and Hidden Markov models, and multiple testing methods. The second edition features new chapters on microarray analysis and on statistical inference, including a discussion of ANOVA, and discussions of the statistical theory of motifs and methods based on the hypergeometric distribution. Much material has been clarified and reorganized. The book is written so as to appeal to biologists and computer scientists who wish to know more about the statistical methods of the field, as well as to trained statisticians who wish to become involved with bioinformatics. The earlier chapters introduce the concepts of probability and statistics at an elementary level, but with an emphasis on material relevant to later chapters and often not covered in standard introductory texts. Later chapters should be immediately accessible to the trained statistician. Sufficient mathematical background consists of introductory courses in calculus and linear algebra. The basic biological concepts that are used are explained, or can be understood from the context, and standard mathematical concepts are summarized in an Appendix. Problems are provided at the end of each chapter allowing the reader to develop aspects of the theory outlined in the main text. Warren J. Ewens holds the Christopher H. Brown Distinguished Professorship at the University of Pennsylvania. He is the author of two books, Population Genetics and Mathematical Population Genetics. He is a senior editor of Annals of Human Genetics and has served on the editorial boards of Theoretical Population Biology, GENETICS, Proceedings of the Royal Society B and SIAM Journal in Mathematical Biology. He is a fellow of the Royal Society and the Australian Academy of Science. Gregory R. Grant is a senior bioinformatics researcher in the University of Pennsylvania Computational Biology and Informatics Laboratory. He obtained his Ph.D. in number theory from the University of Maryland in 1995 and his Masters in Computer Science from the University of Pennsylvania in 1999. Comments on the First Edition. "This book would be an ideal text for a postgraduate courseâ¦[and] is equally well suited to individual studyâ¦. I would recommend the book highly" (Biometrics). "Ewens and Grant have given us a very welcome introduction to what is behind those pretty [graphical user] interfaces" (Naturwissenschaften.). "The authors do an excellent job of presenting the essence of the material without getting bogged down in mathematical details" (Journal. American Staistical. Association). "The authors have restructured classical material to a great extent and the new organization of the different topics is one of the outstanding services of the book" (Metrika). | |
| 12. Structural Bioinformatics (Methods of Biochemical Analysis, V. 44) | |
![]() | |