Comparison of Logistic regression, Convolution Neural Network, and Kernel Approaches for Classifying the Caenorhabditis Elegans Motion
Abstract
Time series data are widely used in many fields including microbiology data. It is necessary to know how to classify the category to which observation belongs by using statistical classification methods and machine learning and deep learning algorithms. The study of the movement of some types of nematodes as one of the types of microorganisms including Caenorhabditis elegans (CE) is important to determine the actions and their impact on the life of the worms. In this study the CE motion time series data were represented by its wave motion angles which would be the study case. the non-linearity and uncertainty will be among the most common problems in this type of data that may lead to classifications that are not accurate. Convolutional Neural Network (CNN) will be used as one of the deep learning techniques and it is a non-linear method used to classify CE movement as a dependent variable in binary cases based on images of wave motion angles as an independent variable and its use will lead to accurate results because it is a suitable non-linear method to deal with Study data to solve nonlinearity and uncertainty problems through digital data visualization. Logistic regression (LR) and kernel method were also used to classify CE angles of movement. The AR(p) rank was used to determine the structure of the used methods. And by comparing the results between the methods used, it was found that the CNN method is superior to the other methods used. Therefore, it is possible to conclude that the use of the CNN method, which is based on pictorial classification, leads to accurate classification results compared to other methods based on numerical classification.
References
- .1 Javer, A., et al. Identification of C. elegans strains using a fully convolutional neural network on behavioural dynamics. in Proceedings of the European Conference on Computer Vision (ECCV) Workshops. 2018.
- .10 Neapolitan, R.E. and X. Jiang, Artificial intelligence: With an introduction to machine learning. 2018: CRC Press.
- .11 Zhao, B., et al., Convolutional neural networks for time series classification. Journal of Systems Engineering and Electronics. 2017; 28 (1): 162-169.
- .12 Worster, A., J. Fan, and A. Ismaila, Understanding linear and logistic regression analyses. Canadian Journal of Emergency Medicine. 2007; 9 (2): 111-113.
- .13 Smola, A.J., S. Vishwanathan, and T. Hofmann. Kernel methods for missing variables. in International Workshop on Artificial Intelligence and Statistics. 2005. PMLR.
- .14 Luque, A., et al., The impact of class imbalance in classification performance metrics based on the binary confusion matrix. Pattern Recognition. 2019; 91: 216-231.
- .15 Yemini, E., et al., A database of c. elegans behavioral phenotypes. Nature Methods. 2014; 10 (9): 877879.
- .16 Bilbao, A., et al., Roll maneuvers are essential for active reorientation of Caenorhabditis elegans in 3D media. Proceedings of the National Academy of Sciences. 2018; 115 (16): E3616-E3625.
- .2 Uhlemann, J., O. Cawley, and T. Kakouli-Duarte. Nematode Identification using Artificial Neural Networks. in DeLTA. 2020.
- .3 Garca Garv, A., et al., Towards lifespan automation for Caenorhabditis elegans based on deep learning: analysing convolutional and recurrent neural networks for dead or live classification. Sensors. 2021; 21 (14): 4943.
- .4 Newell Stamper, B.L., et al., Movement decline across lifespan of Caenorhabditis elegans mutants in the insulin/insulinlike signaling pathway. Aging cell. 2018; 17 (1): e12704.
- .5 Maertens, T., et al., Multilayer network analysis of C. elegans: Looking into the locomotory circuitry. Neurocomputing. 2021; 427: 238-261.
- .6 Le, Q., T. Sarls, and A. Smola. Fastfood-approximating kernel expansions in loglinear time. in Proceedings of the international conference on machine learning. 2013.
- .7 Brockwell, P.J. and R.A. Davis, Time series: theory and methods. 2009: Springer science & business media.
- .8 Liu, L.-M., Time Series Analysis and Forecasting. 2nd ed. 2006, Illinois, USA: Scientific Computing Associates Corp.
- .9 Theobald, O., Machine learning for absolute beginners: a plain English introduction. Vol. 157. 2017: Scatterplot press.