Secondary Analysis: Spatial Autocorrelation, Jaccard Coefficient & Moran's I

The direction of this project and the types of analysis that were used significantly changed from the original proposal after I further analyzed the data.
Originally I wanted to look at spatial autocorrelation between species that were both highly frequent in the Young Growth stand (PLOTA) and highly clustered in the Old Growth stand (PLOTC). By identifying these species of interest I thought that I could calculated spatial autocorrelation values amongst the species. I believed that if i was able to identify which species were highly correlated I could therefore conclude that the highly correlated species would grow well together. I hypothesized that by determining which species would grow well together, those species could be planted in clear-cut stands in the Philippines to try and relieve some of the stress on Old Growth forest stands from illegal loggers. While in theory this idea sounded good, once I started further analyzing the data and the method I had originally chosen to determine the spatial autocorrelation I realized that my original plan was flawed. Unfortunately it took a lot of prelimary work to realize that my plan was flawed.

Firstly I had to determine which species fit the criteria to be species of interest. I used a basic plotting program in R to determine if any clustering was occurring in the Old Growth stand (PLOTC) and where in the stand it was located. This analysis indicated that species clustering was indeed occurring and it was mainly located in the southwest corner of the stand.

The next step was to divide the Old Growth stand (PLOTC) into subplots and record which species were present in each subplot. This process was quite time consuming because it had to be done by hand. By identifying which species were present in each subplot I could test the dissimilarity between subplots, using the Jaccard coefficient, to establish a border around a cluster of species. If I could establish a border around a cluster I could easily determine which species fall inside the border and those species would become the species of interest for my analysis.
To calculate the Jaccard coefficient I wrote the following program in R:

 
function (a,b,c) {

dij=1-(a/(a+b+c))

print("Dissimilarity Coefficient: ")
print(dij)
}

To run the program I manually counted the number of species that were common and the number of species that were different amongst the subplots of interest and entered those values into the program. The outputs from this analysis showed that my hypothesis that clustering was occurring in the south-west corner of the Old Growth stand (PLOTC) was true and I was able to draw a border around the cluster. FIGURE 14 shows the Old Growth stand Plot including the cluster border. From this image and the work I had previously done, I was able to identify 12 species that fell within the border.

FIGURE 14

The second stage of this analysis was to determine the spatial autocorrelation between the 12 species of interest. To calculate the autocorrelation I used the Moran's I Autocorrelation Statistic, but I had to reformat the equation since it is generally used to measure the spatial autocorrelation between subplots not species. I reformatted the Moran's I statistic to be the following:


Moran's I Equation


where xi = number of sub-plots species i is located in, xj = number of sub-plots species j is located in, = average number of plots that have both species present, = the ij-value of the NxN spatial weight matrix and N = total number of sub-plots. The weight matrix, W, is matrix that show the relationship between sub-plots that are neighbours. Each entry in the matrix is the number of neighbouring sub-plots that contain the species. Outputs from the equation will lie between -1 (negatively correlated) and 1 (positively correlated).

Writing a program in R to calculate the Moran's I statistic was quite complicated because I had to account for neighbouring within a cell if the species I was analyzing were both present in the same cell. It was easier to calculate the Moran's I statistic by hand. This was extremely time consuming, because the neighbouring values had to be counted for each occurrence of a species in relationship to the second species being analyzed. Since there were 12 species of interest this process had to be repeated a large number of times for both the Young Growth stand (PLOTA) and the Mid Growth stand (PLOTB). From my analysis I calculated the spatial autocorrelation between the species Bagokon and Balante and the spatial autocorrelation between Bagokon and Biri to be 4.45112 and 3.221 which are incorrect. As a result of calculating these values I reevaluated my analysis and determined that my logic and my reformation of the Moran I statistic was flawed.

I wanted to conduct an analysis that still focused around determining a species’ successional classification because I felt that by determining the classification it could lead to the primary goal of relieving stress on Old Growth forests in the Philippines by illegal loggers. While my initial ideas on how to conduct this analysis did not work some of the work that I put into the analysis help in the final analysis of this project. The hand work I did to determine the size of subplots within the Old Growth stand (PLOTC) and record the presence of species within the subplots helped to determine the optimal size of subplots for the Morisita's Index cluster analysis. Although the initial plan for this project did not work it evolved it to an analysis that better fit the data and provided more ecologically relevant information.


HOME
INTRODUCTION
DATA DETAILS
MULTIVARIATE METHODS
MULTIVARIATE RESULTS & DISCUSSION
CONCLUSION
APPENDICES
REFERENCES & ACKNOWLEDGEMENTS

DATA PREPARATION METHODS
DATA PREPARATION RESULTS & DISCUSSION

PRELIMINARY ANALYSIS