I'm a permanent research scientist (Chargé de recherche) at the Inria branch at the University of Montpellier, as a member of the Iroko and Pl@ntNet teams. My research lies in ecological statistics and confronts statistical models and machine learning methods with massive heterogeneous biodiversity data, often opportunistic citizen science records, in order to measure and anticipate the effects of human pressure on ecosystems, in particular plant invasions, and inform conservation planning.
Specifically, I developp robust methods for spatio-temporal Species Distribution Models from heterogeneous biodiversity data. To make the most out of the available biodiversity data, notably opportunistic records from citizen science plateforms (e.g. Pl@ntNet, iNaturalist), I use statistical observation models, such as point processes, accounting for the peculiarities of diverse types of species observations (e.g. presence-only records, presence-absence vegetation plots) while correcting for sampling biases in space, time, across species or observers. Through deep learning based SDMs (deepSDMs), I try to make SDMs able to deal with heterogeneous input variables, such as satellite images, satellite time series or environmental rasters, which can enable to capture biodiversity patterns at a high resolution. Bringing loss functions from statistical ecology, e.g. based on point processes, into the deep learning world is a stimulating challenge that enables to scale deepSDMs to large spatio-temporal coverages (Europe), many species (e.g. thousands) and a high spatial resolution (e.g. 50 meters) while keeping the computational burden and carbon footprint low. With the Pl@ntNet team and in the horizon Europe projects MAMBO, GUARDEN, B3 and now BEAGLE, we turn these methodological advances into useful operational tools for biodiversity conservation in Europe. For instance, we have delivered high resolution maps of habitats and plant species distribution at European scale as a user-friendly web application for : GeoPl@ntNet. Current challenges are to downscale temporal trends of species occupancy and Essential Biodiversity Variables, notably thanks to citizen science presence-only records, in order to map biodiversity changes, and in particular species invasions, growth or declines in space, and further connect them to local drivers (climate change, agriculture, urbanisation). To further unveil ecological processes underlying species distribution changes, focusing on biological invasions, I try to delimit the estimability of (process-based) invasion dynamic SDMs from heterogeneous data, and to deliver a robust estimation routines for such models as an R package.
Regarding my earlier academic trajectory, I did my PhD INRAE at the UMR AMAP, Montpellier, France, where I studied statistical methods for species distribution models (SDM) based on large presence-only datasets coming from citizen sciences programs. The exploitation of data from the project Pl@ntNet was a major motivation for my thesis, and I collaborated closely with this project. My work included to (i) evaluate the benefits of deep learning and convolutional neural networks approaches for presence-only SDM, leading to pioneer deep learning approach to SDMs (deepSDMs), (ii) caracterize biases arising due to the distribution sampling effort, species niches and background points in presence only SDM based on Poisson point processes (iii) develop an new unbiased approach based on a joint model of sampling effort and species distributions, and (iv) measure the sampling and taxonomic coverage of Pl@ntNet contributions, in order to compare it with national botanical conservatories. This PhD was founded by a the national INRA-INRIA scholarship of 2016.
I did a first 17 months CNRS PostDoc in the Laboratoire d'Ecologie Alpine, Grenoble, working for the project EcoNet. We explored the use of graph embedding methods to compare ecological interactions network architectures across space, environment or time. I especially leveraged the knowledge of trophic interaction between European terrestrial vertebrates and broad scale crowdsourcing data (GBIF) to study the changes of trophic network architectures related to higher land use intensity.
I worked with David Richardson and Cang Hui for 18 months at the Center for Invasion Biology and Biomath Hub of Stellenbosch University, South Africa, where we explored how iNaturalist and other crowdsourcing data could complete more standardized data for studying plant invasions. The use cases spanned various scales, from the early detection of the naturalisation of cultivated species around South African urban areas to the global monitoring of introductions. I also started developping a Dynamic Species Distribution Modeling (DSDM) approach combining past datasets (e.g. Southern African Plant Invaders Atlas) with recent crowdsourcing records (e.g. iNaturalist) to reconstruct plant invasion spatial dynamics over decades and better understand their drivers.