Overview
Last updated
Was this helpful?
Last updated
Was this helpful?
Codatta is a decentralized network connecting AI developers with Data Creators to Co-Train AGI and it is powered by XnY. Since our launch in April 2024, we have attracted over 200,000+ users, with growing demand across our key sectors: Digital Assets, Healthcare, Robotics, and AI Ecommerce —highlighting the need for high-quality frontier data (X-data + Y-data).
We aim to address significant challenges posed by centralized data systems:
Data Silos: Data is often locked within centralized entities, limiting accessibility and utility.
Lacking Data Integrity: In-house and centralized models lack transparency in data curation and quality verification, and suffer from a single point of failure.
Inefficiency: The traditional model of curating data is costly and involves a lot of duplicative efforts among projects relying on the same dataset.
In contrast, Codatta aggregates and curates data from various sources, ensuring high quality and confidence through collaborative and automated mechanisms.
We build systems to scale up the curation and confidence of the following data:
Account Annotation: This data pack contains categories (such as 'cex', 'scam') and entities (such as 'Uniswap') associated with an account. These data are critical for building AML software, on-chain risk management, and trend analysis. For privacy protection, Codatta will not accept entity information revealing individual ownership of accounts or addresses. This ensures privacy while enabling confidence and quality measurement computation. If you are interested in the methodology of improving data confidence, refer to the following sections.
User Demographic Annotation: This includes gender, age group, primary residence region, and other personal demographic details. These data enable personalized experiences in decentralized social and e-commerce applications. Additionally, this information can be monetized via decentralized ad delivery.