Monthly Archives: September 2017

Using NLTK on Heroku with Python

On the principle of the “Extract PDF text with Python” ticket, I will create a service that uses the NLTK package. NLTK is a set of tools for building language processing programs in Python. It therefore requires using Python. Basic … Continue reading

Posted in NLP, Tutorial | Leave a comment

Services for bibliographic analysis

I present here the needs related to our approach of analysis of the production and publication of scientific documents – essentially articles – by Telecom ParisTech. It is the goal of the SemBib project. The articlesTélécom ParisTech has a bibliographical … Continue reading

Posted in NLP, SemBib | Leave a comment

Extract PDF text with Python

As part of our SemBib project to analyze the scientific production of Telecom ParisTech, I recover a lot of PDF files. To analyze the content, I need to get the raw text. In addition, as indicated in the blog Services … Continue reading

Posted in NLP, SemBib, Tutorial | Leave a comment