Document Analysis Notes 1
Categries:
Notes
Document Analysis
Information Retrieval (IR)
- Document search
- Media search
- Question answering
- Recommendation systems
Natural Language Processing (NLP)
Extract information from text
Generate new text
Challenges
- Ambiguous
- Meaning
- Multilinguality
Typical IR and NLP Pipeline
Sentence Splitting -> Tokenisation -> Stemming -> Parsing -> Semantic Analysis