Decision Tree Classification

Machine Learning

You can enter a new title

The first column is gene names, other columns are sample expression values. Sample name format must be: SampleID_Group, e.g., GSM123_control, GSM124_treat

CP value balances model accuracy and complexity. Recommended range: 0.0001-0.01

💾 Data Retention: Upload files retained for 10days, Result files retained for 15daysData will be retained after package expiration. Please download important data in time.

Task UUID:
Created At:
Analysis Progress

📋 Analysis Log


                            

Error Information

📁 Main Task Files (for subsequent analysis) (Inherited from parent)

Result Files

🔄 Adjust Parameters & Re-analyze

Only the following parameters can be adjusted in sub-task

No sub-tasks yet

Login to View Results

Login to view your analysis results

Tool Introduction & Use Cases

Decision Tree Classification is primarily used for bioinformatics analysis.

Decision tree is a classic machine learning classification algorithm that makes classification decisions by building a tree structure. Sample names must contain group information in the format: SampleID_Group (e.g., GSM123_control, GSM124_treat). The tool automatically extracts group information from sample names, trains a decision tree model, and outputs gene importance ranking and decision tree visualization.

Main Use Cases:

  • Explore data patterns and support subsequent in-depth analysis

Key Features:

  • Standardized processing and result export
  • Key charts and table output
  • Can be used as input for subsequent analysis

Parameter Details

ParameterMeaning & RecommendationsSettings
Differential Gene Expression Matrix差异基因表达矩阵文件,样本名需包含分组信息(格式:SampleID_Group)
Required
Default: None
CP Value (Complexity Parameter)P-value Threshold: This is the statistical significance threshold. Only genes/proteins with P-values below this threshold are considered significantly changed, not just random fluctuations. Recommendation: Typically set to 0.05 or 0.01. Lower values are more stringent and reliable but may miss some potentially important molecules.
Optional
Default: 0.00028
Figure Size图片尺寸设置,格式:高*宽
Optional
Default: 8*12

Input File Requirements

Please ensure your input file format is correct, as this is fundamental to successful analysis. Typically, you need to provide a data matrix containing gene/protein expression values.

This tool does not currently provide sample files. Please refer to the common format: typically CSV or TXT files with the first column as gene/protein IDs, followed by expression values for each sample.

Frequently Asked Questions

Q: How long does the analysis take?

A: Estimated execution time is approximately 300 seconds, depending on your data size and current server load.

Q: What if the analysis fails?

A: If the analysis fails, your credits will not be deducted. First, check if your input file format matches the sample file, or adjust parameters based on error messages. If the problem persists, please contact us through the page's feedback function.

Login to View History

Login to view your analysis task history

Feedback Tips

We value your feedback! Please describe your issue or suggestion in detail, and we will reply as soon as possible.

Please describe your issue or suggestion in detail to help us better assist you

Please login before submitting feedback

My Feedback History

Login to view your feedback history