Dr. Mark Gardener


GO... 

Community EcologyAnalytical Methods Using R and Excelby: Mark GardenerAvailable now from Pelagic Publishing. On this page find a table of contents and outline. There is also a list of keywords at the end (this is comprehensive but not complete). See the custom R commands I worte during production. Some of the commands are already illustrated on my Writer's Bloc page. See also the Support Files page for data, spreadsheets and RData. See the News page for details about updated files, new custom commands and so on. What you will learn in this book  How this book is arranged  Chapterbychapter Outline 

Back to top 
Community Ecology: Table of ContentsThe book is split into 14 chapters, the first few are designed to get you up and running by helping you plan your approach and to become familiar with the software tools you'll be using. The later chapters delve into various major topics of community ecology. The appendix contains answers to the endofchapter exercises and a list of custom R commands created during the writing of this book. Click on a heading to go to a more detailed description:


Back to top 
What you will learn in this bookThis book is intended to give you some insights into some of the analytical methods employed by ecologists in the study of communities. The book is not intended to be a mathematical or theoretical treatise but inevitably there is some maths! I've tried to keep this in the background and to focus on how to undertake the appropriate analysis at the right time. There are many published works concerning ecological theory; this book is intended to support them by providing a framework for learning how to analyse your data. The book does not cover every aspect of community ecology. There are a few minor omissions – I hope to cover some of these in later works. 

Back to top 
How this book is arrangedThere are four main strands to scientific study; planning, recording, analysis and reporting. The first few chapters deal with the planning and recording aspects of study. You will see how to use the main software tools, Excel and R, to help you arrange and begin to make sense of your data. Later chapters deal more explicitly with the grand themes of community ecology, which are:
The reporting element is not covered explicitly, however the presentation of results is shown throughout the book. A more dedicated coverage of statistical and scientific reporting can be found in my previous work, Statistics for Ecologists Using R and Excel. Each chapter ends with a summary and several selfassessment questions – the answers are in the appendix. The appendix also contains a list of the custom R commands made during the production of this book. These commands are available as part of the CERE.RData file. Throughout the book you will see example exercises that are intended for you to try out. In fact they are expressly aimed at helping you on a practical level – reading how to do something is fine but you need to do it for yourself to learn it properly. The Have a Go exercises are hard to miss. 

Book Outline 

Back to top  1 Starting to look at communities
1.1 A scientific approach This short chapter sets the scene and aims to get you thinking in a scientific manner. Section 1.2 provides a brief overview of the topics covered in the rest of the book. 

Back to top  2 Software tools for community ecology 2.1 Excel This chapter is also pretty short and is aimed at helping you to get the software tools you'll need. In particular there is a section on how to download and install the R program. In general you'll be shown how to manage and manipulate your data using Excel, with the majority of the analyses being carried out using R. Where it is feasible to use Excel for an analysis then you'll be shown how to do it using Excel and R. However, as analyses become more complicated you'll see R used more exclusively. 

Back to top 
3.1 Biological data The more complicated your data are the more important it is that you arrange your data in a "set" manner. This short chapter provides an introduction to the notion of "Biological Recording", simply a way to write your data in a flexible format that allows you to access and manage it easily. 

Back to top  4 Beginning data exploration  Using software tools 4.1 Beginning to use R This chapter is where you'll start becoming familiar with the software tools that you'll be using most often, your spreadsheet (probably Excel) and R, the statistical programming environment. Section 4.1 will get you started using R, so if you've never used R before then this will be worth working through. Section 4.2 gives you some help using Excel, including the use of Pivot Tables and lookup tables as well as more familiar things like sorting and filtering. The final section shows you how to transfer data from Excel into R. 

Back to top  5 Exploring data – Choosing your analytical method 5.1 Categories of study This chapter provides an overview of the analytical methods that you might use in exploration of ecological community data. There is also mention of a few approaches that are not so useful! The idea is to provide you with a sense of the analytical approaches that you can undertake so that you can plan your studies more effectively. 

Back to top  6 Exploring data – Getting insights 6.1 Error Checking This chapter is concerned with managing your data and starting to make sense of it. The first section is about error checking, an oft neglected aspect of data analysis! You may spot some errors in the datasets used as examples in this book. I've deliberately left some of these in place to keep you "on your toes". Section 6.2 is about adding extra information; this can be particularly useful to help group and regroup your data for later analysis. The final section is about looking at your data without doing any "real" analysis. This means getting averages for groups and showing your data in graphical form. Most of this can be done in Excel and this is illustrated using Pivot Tables and chart tools. The latter part of section 6.3 shows you how to do similar things in R – if you are not familiar with R then you'll find this material especially helpful. 

Back to top  7 Diversity – Species Richness 7.1 Comparing species richness Diversity is one of the grand themes of community ecology covered by this book. In fact it is such a large topic that it is split into five chapters. This chapter is about species richness; put another way it is about what you can do when you only have presenceabsence data. The last section covers species and sampling area, this includes rarefaction and estimating species richness using nonlinear modelling. 

Back to top 
8.1 Simpson’s Index This is about indices of diversity, that is, ways of taking species relative abundance into account as well as the number of species in a sample. The most commonly used indices are Simpson's and Shannon, so these are highlighted with their own sections. In section 8.3 you'll see other indices of diversity such as Fisher's alpha, BergerParker dominance and two entropies, Rényi and Tsallis. The chapter focusses on calculating the index values but you'll also find out about evenness and the idea of "effective species", the latter being what is also termed "true diversity". 

Back to top 
9.1 Graphical comparison of diversity profiles This chapter shows you a variety of ways to set about comparing diversity between samples in a meaningful manner. There are two main approaches aside from a purely graphical approach. Section 9.2 explores versions of the ttest that have been developed for use with Simpson's and Shannon indices. These are popular so I've devoted quite a bit of space to them. In sections 9.4 and 9.5 you'll see methods for using bootstrapping to assess differences between samples. Bootstrapping is a way of randomising samples and is becoming increasingly common as a technique, due no doubt to the increasing sophistication of computer software. 

Back to top 
10.1 Calculating beta diversity Diversity can be measured at several scales and this chapter is mostly about beta diversity – that is, changes in diversity from sample to sample. Section 10.1 focusses on simply determining beta diversity between samples as well as dome methods for visualising the results (as dendrograms or ternary plots). Sections 10.2 to 10.5 are concerned with methods for exploring changes in diversity between samples and getting an idea of the significance of these changes. Section 10.6 looks at beta diversity from another aspect, that of overlap (or similarity). In section 10.7 you'll see how you can calculate beta diversity using various metrics of dissimilarity. The final section looks at one way to explore beta diversity in relation to other variables – this involves use of Mantel tests. 

Back to top  11 Rank abundance or dominance models 11.1 Dominance models One way to look at diversity is to rank species in order of their abundance and then plot the results on a graph, usually with a log scale. The shape of the graph can be modelled and many attempts have been made to link ecological theory to the shape of these models. This chapter is about these rank abundance models (or dominance models). In section 11.1 you'll see the main models used and you'll be able to see the "best" model for your samples. Sections 11.2 and 11.3 focus on two of the lognormal models, Fisher's logseries and its "successor", Preston's lognormal. 

Back to top  12 Similarity and Cluster Analysis 12.1 Similarity and Dissimilarity Similarity and cluster analysis is one of the grand themes of community ecology. In section 12.1 you'll see how to produce indices of similarity (and conversely dissimilarity) for samples using presenceabsence or abundance data. Following on from this, section 12.2 covers cluster analysis, where samples are grouped together. This allows you to visualise your samples based on their composition (i.e. their dissimilarity or similarity to one another). There are two main approaches to clustering, hierarchical and partitioning. Both are demonstrated using a range of techniques. 

Back to top  13 Association analysis: Identifying Communities 13.1 Area approach to identifying communities The analyses in the rest of the book assume that your samples are already "sorted" into communities. This chapter covers association analysis, which is the means by which you can identify which species tend to live together and which do not. In this way you can identify the species that make up various communities. This kind of analysis alters the focus from the samples to the species themselves. The main thrust of association analysis is the chi squared test and this is what you'll see in sections 13.1 and 13.2. In section 13.3 you'll see how to use other metrics of dissimilarity to do a similar job. The final section is about indicator species. Chi square is not the only approach to analysis of indicator species but it "fits" with the theme of the chapter. There are other ways to look at indicator species but I have not included them here. I hope to produce a monograph about TWINSPAN at some point in the future. The DufreneLegendre method IndVal, could have been included as it can be calculated using R; I will try to fit it into a later edition or add material to the website. 

Back to top 
14.1 Methods of ordination The topic of ordination (or multivariate analysis) is a broad one. Section 14.1 gives an overview of the main methods of ordination and some clues as to which might be used for which occasion. There are two main strands to ordination, indirect gradient analysis and direct gradient analysis. The former is used when you have species data, the latter is used when you wish to incorporate environmental data. Section 14.2 covers BrayCurtis ordination, which you'll see illustrated using Excel. This is useful for "beginners" as it helps to work out what the main thrust of ordination is about. The other methods are not trivial to conduct using Excel so you'll use R for them. Later in the section you'll see MDS, NMDS, PCO, PCA, CA and DCA used to explore species composition. You'll also see how to incorporate environmental data to the results of your ordination. In section 14.3 you'll see CCA and RDA used as the main tools for direct gradient analysis. There is a short section on modelbuilding, showing how you can build the "best" explanatory model for your data. The final section looks at a few ancillary aspects of ordination, such as adding environmental information, and identifying groupings on plots. You'll also see how to carry out Procrustean rotation to compare ordination results. Finally there is a short review of alternative methods to ordination – most of which are covered elsewhere in the book (particularly in chapter 10). 

Back to top 
15.1 Answers to Exercises Each chapter ends with a summary and several selfassessment questions. The answers to these questions are here in appendix 15.1. During the writing of this book I wrote a number of custom R commands that I thought were useful, they are mentioned in the text at appropriate points. The appendix 15.2 gives a complete list of the custom commands as well as notes on their use. More details for the commands (such as examples of use) can be found on the Custom Commands Page. 

Back to top 
My books on ecology and data analysisStatistics for Ecologists is available now from Pelagic Publishing. Get a 20% discount using the S4E20 code! 

See also... Learn
to use R for statistical analyses: Index
page Keywords:Here is a list of keywords – it is by no means complete! Diversity, biodiversity, species richness, rarefaction, species accumulation curves, presenceabsence, Arrhenius, Gleason, Gitay, Lomolino, Gompertz, MichaelisMenten, Weibull, Chao, Jackknife, ACE, beta diversity, alpha diversity, gamma diversity, rankabundance, Shannon entropy, Simpson's index, Rényi entropy, Tsallis entropy, Fisher's alpha, Fisher's logseries, Preston lognormal, bootstrapping, BergerParker, Hill numbers, Jevenness, Eevenness, evenness, effective species, Bonferroni correction, ttest, confidence interval, species turnover, Whittaker beta diversity, Whittaker turnover, absolute turnover, proportional turnover, additive diversity partitioning, hierarchical partitioning, group dispersion, multivariate homogeneity of group dispersion, permutation, MANOVA, species overlap, Mantel test, dominancediversity models, lognormal, preemption, preemption, Zipf, Mandelbrot, BrokenStick, Motomura model, Geometric series, similarity, clustering, Jaccard, Sørensen, BrayCurtis, Mountford, Raup, Horn, Manhattan, Euclidean, Canberra, Gower, Kulczynski, Minkowski, dendrogram, Morisita, standardisation, standardization, Wisconsin, hierarchical cluster analysis, partitioning around medoids, Ward, McQuitty, agglomerative nesting, divisive clustering, Kmeans analysis, HartiganWong, Lloyd, Forgy, MacQueen, Fuzzy cluster analysis, Chi Squared, chisquared, Pearson residuals, association analysis, indicator species, Ordination, multivariate analyses, direct gradient analysis, indirect gradient analysis, orthogonal, Polar Ordination, Principal coordinates analysis, principal coordinates analysis, PCO, PCoA, Classical Metric Multidimensional Scaling, MDS, nonmetric multidimensional scaling, NMDS, Principal components analysis, PCA, correspondence analysis, reciprocal averaging, CA, detrended correspondence analysis, DCA, decorana, canoco, redundancy analysis, RDA, eigenvalue, inertia, arch effect, canonical correspondence analysis, constrained correspondence analysis, conditioning variables, Procrustes 
