As for the "context", I think you mean that you do not have whole sentences, i.e. there are no previous / next tokens, in which case you are faced with a non-standard NER. I do not know about the available program or training data for this particular problem, if you did not find them, you will have to create your own corps for training and / or evaluation purposes.
Your name database is likely to help significantly, depending on how much of the account name is actually present in the database. You probably also have to rely on character-level name morphology as patterns (see Pattern Examples in [1]). When you have a training kit with functions (availability in the database, morphology, other account information) and solutions (actual names of annotated bills), using standard machine learning as an SVM will be quite simple (if you are not familiar with this, simply ask).
Some other suggestions:
- You can probably also use other account information: company name, position, tax reference, etc.
- You can also act selectively - if all accounts must indicate (exactly?) The name of one person, you can exclude all other texts (for example, amounts, tax names, positions, etc.) or accept in a separate model that among the whole text in the account, only one should be guessed as a name.
[1] Ranking algorithms for retrieving the desired object: magnification and voted perceptron (Michael Collins, 2002)
source share