Focal Loss for Punctuation Prediction

Jiangyan Yi, Jianhua Tao, Zhengkun Tian, Ye Bai, Cunhang Fan


Many approaches have been proposed to predict punctuation marks. Previous results demonstrate that these methods are effective. However, there still exists class imbalance problem during training. Most of the classes in the training set for punctuation prediction are non-punctuation marks. This will affect the performance of punctuation prediction tasks. Therefore, this paper uses a focal loss to alleviate this issue. The focal loss can down-weight easy examples and focus training on a sparse set of hard examples. Experiments are conducted on IWSLT2011 datasets. The results show that the punctuation predicting models trained with a focal loss obtain performance improvement over that trained with a cross entropy loss by up to 2.7% absolute overall F1-score on test set. The proposed model also outperforms previous state-of-the-art models.


 DOI: 10.21437/Interspeech.2020-1638

Cite as: Yi, J., Tao, J., Tian, Z., Bai, Y., Fan, C. (2020) Focal Loss for Punctuation Prediction. Proc. Interspeech 2020, 721-725, DOI: 10.21437/Interspeech.2020-1638.


@inproceedings{Yi2020,
  author={Jiangyan Yi and Jianhua Tao and Zhengkun Tian and Ye Bai and Cunhang Fan},
  title={{Focal Loss for Punctuation Prediction}},
  year=2020,
  booktitle={Proc. Interspeech 2020},
  pages={721--725},
  doi={10.21437/Interspeech.2020-1638},
  url={http://dx.doi.org/10.21437/Interspeech.2020-1638}
}