Skip to content

A JSON API to tag a sentence with part of speech tags. Uses UDPipe, so support for hundreds of languages.

License

Notifications You must be signed in to change notification settings

EmilStenstrom/json-tagger

Repository files navigation

JSON Tagger

UDPipe finds detailed Part-of-speech tags (Noun, Verb, ...) in Swedish sentences. This code makes UDPipe available via a JSON API.

Play with it at: https://json-tagger.sammanfatta.se

How to run JSON Tagger locally

JSON-Tagger is built for Python 3.6. I haven't tested it on other versions, so it might work or other 3.x versions, but not on Python 2.

  1. Clone this project from GitHub:
git clone https://github.com/EmilStenstrom/json-tagger.git json-tagger
  1. Install dependencies:
cd json-tagger
pip install -r requirements.txt
  1. Get a UDPipe model file

Download the latest version of the udipe models from http://ufal.mff.cuni.cz/udpipe#download. Pick the language you are interested in, create a data directory in the root of the project, and put the .udipe file there. Now update the path to the file in ud_helper, and in actions.py if you use a language other than Swedish. Done!

  1. Start the local web server
python run.py --run
  1. Surf to http://localhost:8000 in your browser!

Run the tests

The trickiest part of delivering an API like JSON Tagger is to handle encodings. I've found that the easiest way to make sure I don't mess them up is to run code that accesses the API from different languages. To run some simple integration tests against a version running locally:

  1. Install dependencies

The scripts assume you are running them inside a virtualenv with python pointing to Python 3, and that python2 and curl is available on the PATH.

pip2 install requests
pip install requests
gem install http
npm install -g request
  1. Run all the tests
tests/run_all

If any of the tests fail it will output the difference in output between the result and the expected result.

About

A JSON API to tag a sentence with part of speech tags. Uses UDPipe, so support for hundreds of languages.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published
pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy