Questions tagged [ner]

Named Entity Recognition (NER) involves pulling out specific entities like persons, organization, or places from natural language text.

544 questions
3
votes
1 answer

I want to extract text values from text in spacy

I am new in using spacy. I want to extract text values from sentences training_sentence="I want to add a text field having name as new data" OR training_sentence=" add a field and label it as advance data" So from the above sentence, I…
Amit Kanderi
  • 96
  • 1
  • 6
3
votes
1 answer

What is the list of possible tags with a description of CoNLL 2003 NER Task?

I need to do some NER. I've found DeepPavlov library that does this. Here is an example from docs: from deeppavlov import configs, build_model ner_model = build_model(configs.ner.ner_ontonotes, download=True) ner_model(['Bob Ross lived in…
rominf
  • 2,123
  • 2
  • 18
  • 31
3
votes
1 answer

Named Entity Recognition influence of previous sentence

In Named Entity Recognition (NER), does the previous sentence have any influence on the current sentence? Is the result the same if you apply NER on every sentence separately compared to applying NER on articles consisting of multiple sentences?…
Guido
  • 4,234
  • 1
  • 19
  • 39
3
votes
1 answer

How to find similar noun phrases in NLP?

Is there a way to identify similar noun phrases. Some suggest use pattern-based approaches, for example X as Y expressions: Usain Bolt as Sprint King Liverpool as Reds
Shimak
  • 77
  • 1
  • 5
3
votes
1 answer

Adding entities to Stanford NLP NER Classifier

I have a very simple method to extract Names, Organisations and Locations from a string. I am using the .NET Nuget Libraries for Stanford NLP. It looks like this. CRFClassifier Classifier =…
Coesy
  • 896
  • 1
  • 9
  • 29
3
votes
2 answers

spaCy nlp - positions of entities in string, extracting nearby words

Lets say I have a string and want to mark some entities such as Organizations. string = I was working as a marketing executive for Bank of India, a 4 months.. string_tagged = I was working as a marketing executive for [Bank of India], a 4…
Sherly
  • 67
  • 5
3
votes
1 answer

Converting from XML annotations to BRAT format

I have an annotated data set in XML format: see example below Treatment of Erosive Esophagitis in patients where the tagged words are in XML tags as shown. I need to get it into BRAT format, such as: T1 …
876868587
  • 2,802
  • 2
  • 16
  • 43
3
votes
4 answers

Train Spacy NER on Indian Names

I am trying to customize Spacy's NER to identify Indian names. Following this guide https://spacy.io/usage/training and this is the dataset I am using…
shri_wahal
  • 328
  • 1
  • 4
  • 15
3
votes
1 answer

nltk tag tag_sents give different results

I essentially want to use the nltk StanfordNERTagger in order to purify a list of names (eg. there are organizations in there I want to remove) and I stumbled on weird issue. It seems the tag results of one sentence depend on what other sentences…
m.rel
  • 51
  • 3
3
votes
0 answers

Adding domain knowledge (custom features) to NER

I'm on an Ubuntu machine with Python 3.5.2 and spaCy 2.0. I'm training a blank Spanish model to recognize entities in resumes. For that I used custom word embeddings and I'm doing a large entity annotation project. I was able to segment a resume and…
3
votes
2 answers

In spaCy, why is '\n' constantly tagged as GPE by english NER?

I am starting to get acquainted with spaCy v2.0. When I run Lightning_Tour.py with my own documents, I am seeing that the end of line string \n is being consistently tagged as GPE in the entity output. So is there any way to preprocess the document…
demongolem
  • 8,796
  • 36
  • 82
  • 101
2
votes
1 answer

NLP: Create spaCy Doc objects based on delimiters or combine multiple Doc objects to form a single object

I am trying to create a spaCy Doc object (spacy.tokens.doc.Doc) using the make_doc() function. This is what I have done: import spacy nlp = spacy.load('en') a = nlp.make_doc("Sam, Software Engineer") print(list(a)) # [Sam, ,, Software,…
Animeartist
  • 653
  • 1
  • 6
  • 16
2
votes
0 answers

TensorFlow: Using CRF for NER (shape-mismatch) [tensorflow_addons]

I am trying to build a Bi-LSTM CRF model for NER on CoNLL-2003 dataset I have encoded the words using char embedding and GloVe embedding, for each token I have an embedding of size 341 This is my model: def get_model(embed_size, max_seq_len,…
Kuldeep Singh Sidhu
  • 3,574
  • 2
  • 6
  • 19
2
votes
1 answer

Name Entity Recognition (NER) for multiple languages

I am writing some code to perform Named Entity Recognition (NER), which is coming along quite nicely for English texts. However, I would like to be able to apply NER to any language. To do this, I would like to 1) identify the language of a text,…
CrossLord
  • 330
  • 11
2
votes
1 answer

Faster NER extraction using SpaCy and Pandas

I have a df with a column that contains comments from which I want to extract the organisations. This article provides a great approach but it is too slow for my problem. The df I am using has over 1,000,000 rows and I am using a Google Colab…
heinz
  • 25
  • 6
1 2
3
36 37