Home > Research > Publications & Outputs > How well does random forest analysis model defo...

Electronic data

  • Paper_Env_Eco_Stats_17_Sep_2017_LC_14_Oct_2017

    Rights statement: The final publication is available at Springer via http://dx.doi.org/10.1007/s10651-017-0389-8

    Accepted author manuscript, 547 KB, PDF document

    Available under license: CC BY-NC: Creative Commons Attribution-NonCommercial 4.0 International License

Links

Text available via DOI:

View graph of relations

How well does random forest analysis model deforestation and forest fragmentation in the Brazilian Atlantic forest?

Research output: Contribution to journalJournal articlepeer-review

Published
<mark>Journal publication date</mark>12/2017
<mark>Journal</mark>Environmental and Ecological Statistics
Issue number4
Volume24
Number of pages21
Pages (from-to)529-549
Publication StatusPublished
Early online date22/11/17
<mark>Original language</mark>English

Abstract

We assessed the value of applying random forest analysis (RF) to relating metrics of deforestation (DF) and forest fragmentation (FF) to socio-economic (S-E) and bio-geophysical (BGP) factors, in the Brazilian Atlantic Forest of Minas Gerais, Brazil. A vegetation-monitoring project provided land cover maps, from which we derived DF and FF metrics. An ecologic-economical zoning project provided more than 300 S-E and BGP factors. We used random forest analysis (RF) to identify relationships between these sets of variables, and compared its performance in this task to that of a more traditional multiple linear regression approach. We found that RF modelled relatively-well variance in all metrics used (the rate of deforestation, the amount of forest, and the density and isolation of forest patches), presenting a better performance when compared to the classical approach. RF also identified geographical location and topographic factors as being most closely associated with patterns of DF and FF. Both analyses found factors associated with economic productivity, social institutions, accessibility and exploration to have little relationship with metrics. RF was better at explaining variations in rates of deforestation, remaining forest and patch patterns, than the multiple linear regression approach. We conclude that RF provides a promising methodology for elucidating the relationships between land use and cover changes with potential drivers.

Bibliographic note

The final publication is available at Springer via http://dx.doi.org/10.1007/s10651-017-0389-8