کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
5741885 1617192 2017 9 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Consensus methods based on machine learning techniques for marine phytoplankton presence-absence prediction
ترجمه فارسی عنوان
روش های انطباق بر اساس تکنیک های یادگیری ماشین برای پیش بینی عدم وجود حضور فیتوپلانکتون دریایی
کلمات کلیدی
فیتوپلانکتون دریایی، اطلاعات موجود بودن، فراگیری ماشین، روش های توافق غیر همگن، پیش بینی،
موضوعات مرتبط
علوم زیستی و بیوفناوری علوم کشاورزی و بیولوژیک بوم شناسی، تکامل، رفتار و سامانه شناسی
چکیده انگلیسی


- We present six non-homogeneous consensus models to predict the presence-absence of marine phytoplankton species.
- In most of the cases, the consensus models behaved better than the single-models that were used to construct them.
- The single-models considered were generalized linear models, random forests, boosting and support vector machines.
- Our results suggest that attention must be given to consensus methods when dealing with ecological prediction.

We performed different consensus methods by combining binary classifiers, mostly machine learning classifiers, with the aim to test their capability as predictive tools for the presence-absence of marine phytoplankton species. The consensus methods were constructed by considering a combination of four methods (i.e., generalized linear models, random forests, boosting and support vector machines). Six different consensus methods were analyzed by taking into account six different ways of combining single-model predictions. Some of these methods are presented here for the first time. To evaluate the performance of the models, we considered eight phytoplankton species presence-absence data sets and data related to environmental variables. Some of the analyzed species are toxic, whereas others provoke water discoloration, which can cause alarm in the population. Besides the phytoplankton data sets, we tested the models on 10 well-known open access data sets. We evaluated the models' performances over a test sample. For most (72%) of the data sets, a consensus method was the method with the lowest classification error. In particular, a consensus method that weighted single-model predictions in accordance with single-model performances (weighted average prediction error - WA-PE model) was the one that presented the lowest classification error most of the time. For the phytoplankton species, the errors of the WA-PE model were between 10% for the species Akashiwo sanguinea and 38% for Dinophysis acuminata. This study provides novel approaches to improve the prediction accuracy in species distribution studies and, in particular, in those concerning marine phytoplankton species.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Ecological Informatics - Volume 42, November 2017, Pages 46-54
نویسندگان
, , ,