Calculating Optimal Tree-Based Classification
The script "tree_based_classification" takes :
- The script optimally finds non-overlapping GOOD subtrees for some categories.
- All sequences in a GOOD subtree for some category get assigned this category.
- In the optimal tree-based classification the number of sequences whose assigned
category coinsides with reference category,i.e. correctly classified, is maximal.
The main result is summarized in the number( or percent) of misclassified sequences.
The script produces also detailed information:
The list of GOOD categories and corresponding subtrees with the information on their composition.