Visualization of the workflow demonstrating a step-by-step explanation for a sc-SynO analysis. a) Several or one snRNA-Seq or scRNA-Seq fastq datasets can be used as an input. Here, we identify our cell population of interest and provide raw or normalized read counts of this specific population to sc-SynO for training. b) Further information for cluster annotation and processed count data are serving as input for the core algorithm. c) Based on the data input, we utilize the LoRAS synthetic oversampling algorithms to generate new cells around the former origin of cells to increase the size of the minority sample. The trained Machine Learning classifier is validated on the trained, pre-annotated dataset to evaluate the performance metrics of the actual model. The model is now ready to identify the learned rare-cell type in novel data.
Creators
Submitter
Views: 1462 Downloads: 61
Created: 16th Oct 2020 at 11:24
Last updated: 21st Jan 2021 at 19:39
None
Version History
Version 2 (latest) Created 20th Jan 2021 at 20:04 by Markus Wolfien
No revision comments