Collins, M. (2002) ‘Discriminative training methods for hidden Markov models’, in Proceedings of the ACL-02 conference on Empirical methods in natural language processing  - EMNLP ’02. Association for Computational Linguistics, pp. 1–8. Available at: https://doi.org/10.3115/1118693.1118694.
Jurafsky, D. and Martin, J.H. (2009) Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition. 2nd ed. Upper Saddle River, N.J.: Pearson Prentice Hall.
Smith, N.A. (2011) Linguistic structure prediction. San Rafael, Calif: Morgan & Claypool. Available at: http://dx.doi.org/10.2200/S00361ED1V01Y201105HLT013.
Stat NLP Book (no date). Available at: https://github.com/uclmr/stat-nlp-book.