71988

Автор(ы): 

Автор(ов): 

4

Параметры публикации

Тип публикации: 

Статья в журнале/сборнике

Название: 

HighMLR: An open-source package for R with machine learning for feature selection in high dimensional cancer clinical genome time to event data

ISBN/ISSN: 

0957-4174

DOI: 

10.1016/j.eswa.2022.118432

Наименование источника: 

  • Expert Systems with Applications

Обозначение и номер тома: 

Vol. 210

Город: 

  • Амстердам

Издательство: 

  • Elsevier

Год издания: 

2022

Страницы: 

https://www.sciencedirect.com/science/article/abs/pii/S0957417422015329
Аннотация
Machine learning techniques, popularly used as a tool for dimensionality reduction and pattern recognition of features, have been utilized extensively in data mining. In survival analysis, where the primary outcome is the time until a specific event occurs, identifying relevant features for building an efficient prediction model is essential. This is where machine learning can be a suitable option. However, there is an existing gap in utilizing machine learning techniques in high-dimensional survival data due to the non-availability of convenient programming functions and packages. In this article, we have developed an efficient machine learning procedure for analyzing survival data associated with high-dimensional gene expressions. Though there are several R libraries available for performing machine learning, no package support is available to implement machine learning with classification on high-dimensional survival data. highMLR, our developed R package, is capable of implementing machine learning methods on high dimensional survival data and provides a way of feature selection based on the logarithmic loss function. Several statistical methods for survival analysis have been incorporated into this machine learning algorithm. A high-dimensional gene expression dataset has been analyzed using the proposed R library to show its efficacy in feature selection.

Библиографическая ссылка: 

Bhattacharjee A.D., Vishwakarma G.K., Banerjee S., Пащенко А.Ф. HighMLR: An open-source package for R with machine learning for feature selection in high dimensional cancer clinical genome time to event data // Expert Systems with Applications. 2022. Vol. 210. С. https://www.sciencedirect.com/science/article/abs/pii/S0957417422015329.