This experiment uses one dimensional 2 features to predict the dialect. The features are:
The features extraction function is as follows:
For more details, please refer to the feature_extract function. Donot be overwhelmed by the feature extraction function. The function is just a wrapper of all possible features that can be combined and explored.
Separate models perfom better than multi-task learning for the same architecture and parameters. For the sake of fair comparsion, everything is same except that separate models train the MLP model separately for three tasks, while multi-task learning trained and predict the three tasks at the same time.
- Rong Wang, 04.08,2023