Abstract
We present a system employing domain specific dictionaries and features to recognize chemical entities. The system utilizes sentence segmentation, tokenization, feature generation, Conditional Random Field (CRF) training and one post-processing step. The dictio-naries were compiled from PubChem, Wikipedia, ChEMBL, DrugBank,word2vec clusters from US patents belonging to A61K class. We report the evaluation results of the run where development set was not included as part of the training set. The best performing model for CEMP taskh as the micro average precision, recall and F-score values of87.26%,79.98%and83.46%, respectively.