Historical datasets from vast and relatively inaccessible areas are sources of potentially uniqueinformation still valuable for biodiversity studies today. In many research elds, ranging from climate changeto projection of species loss, great efforts have been made to integrate historical datasets with recent datato create databases that are as complete as possible. Unlocking the information contained in presence-onlydata, largely prevalent in such databases, presents a challenge for statistical modeling because of insidiousobservational errors due to the opportunistic nature of the data gathering process. In this article we proposean appropriate statistical method for the joint analysis of historical and newly collected presence-only data,i.e. a Bayesian semi-parametric generalized linear mixed model (GLMM) with Dirichlet process randomeffects. The potential of the method is illustrated by considering the Ross Sea section of the SOMBASE, aninternational compilation of Southern Ocean Mollusc distributional records, from 1899 to 2004 and beyond.Despite the presence of sampling bias and non-detection errors, the proposed model draws latent informationfrom the data such that the resulting estimates of the parameters of interest are not only coherent with thoseobtained in indirectly related studies based on well structured data, but also suggest interesting ideas forfurther research.

A Bayesian semi-parametric GLMM for historical and newly collected presence-only data: an application to species richness of Ross Sea Mollusca

Nava, C. R.;
2017-01-01

Abstract

Historical datasets from vast and relatively inaccessible areas are sources of potentially uniqueinformation still valuable for biodiversity studies today. In many research elds, ranging from climate changeto projection of species loss, great efforts have been made to integrate historical datasets with recent datato create databases that are as complete as possible. Unlocking the information contained in presence-onlydata, largely prevalent in such databases, presents a challenge for statistical modeling because of insidiousobservational errors due to the opportunistic nature of the data gathering process. In this article we proposean appropriate statistical method for the joint analysis of historical and newly collected presence-only data,i.e. a Bayesian semi-parametric generalized linear mixed model (GLMM) with Dirichlet process randomeffects. The potential of the method is illustrated by considering the Ross Sea section of the SOMBASE, aninternational compilation of Southern Ocean Mollusc distributional records, from 1899 to 2004 and beyond.Despite the presence of sampling bias and non-detection errors, the proposed model draws latent informationfrom the data such that the resulting estimates of the parameters of interest are not only coherent with thoseobtained in indirectly related studies based on well structured data, but also suggest interesting ideas forfurther research.
2017
Bayesian hierarchical GLMM
Dirichlet process random effects
Opportunistic sampling schemes
Presence-only data
Ross Sea Mollusca
Species richness
File in questo prodotto:
File Dimensione Formato  
ENV-revision-23luglio.pdf

non disponibili

Licenza: Copyright dell'editore
Dimensione 434.31 kB
Formato Adobe PDF
434.31 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Carota_A_Bayesian_semiparametric_GLMM.pdf

non disponibili

Licenza: Copyright dell'editore
Dimensione 389.13 kB
Formato Adobe PDF
389.13 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14087/14943
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
social impact