Off-the-shelf (OTS) Datasets

Urdu NER news text

Dataset ID:
URD_NER001
Dataset Name:
Urdu NER news text
Common Use Cases:
NER, Content Classification, Search Engines
Language:
Urdu
Country:
Pakistan
Language Code:
urd
Country Code:
PAK
Product Type
Text
Detailed Product Type
News NER
Unit
20,634 sentences
Recording Device
N/A
Recording Condition
N/A
Contributors
N/A
Utterances
20,634
Unique Words
Available on request
Sample Rate (kHz):
N/A
Channels
N/A
Data Format
text
Source
Appen Global
Additional Info:
  • News text corpora with entities tagged in XML format: Person, Title, Organization, Location, Geo-political entity, Facility, Religion, Nationality, Quantity
Year of Collection
2009

Get Started with Off-the-Shelf AI Training Datasets

Appen’s extensive catalog of off-the-shelf (OTS) datasets spans multiple data types and industries, providing comprehensive coverage for various AI applications. These datasets are crafted to the highest standards of quality and accuracy, ensuring reliable training data for AI models.

Talk to an expert