INCIDB ships as a 4-table normalized relational schema exported into pipe-delimited UTF-8 CSV (`|`) and high-performance Apache Parquet (`pyarrow`) archives.
Contains canonical brand identities and ethical certifications across 5,994 global cosmetic lines.
| Field Name | Data Type | Description | Sample Value |
|---|---|---|---|
brand_id | INTEGER | Primary key unique identifier | 1 |
name | STRING | Standardized commercial brand name | Laneige |
country_of_origin | STRING | Country of formulation or headquarters | South Korea |
is_cruelty_free | INTEGER | Cruelty-free certification (1 = Yes, 0 = No) | 0 |
is_vegan | INTEGER | 100% vegan formulation line (1 = Yes, 0 = No) | 0 |
Contains 57,181 canonical chemical compounds enriched with toxicology, allergen, and regulatory data from 8 international bodies.
| Field Name | Data Type | Description | Sample Value |
|---|---|---|---|
ingredient_id | INTEGER | Primary key unique identifier | 1 |
inci_name | STRING | International Nomenclature Cosmetic Ingredient name | DIISOSTEARYL MALATE |
cas_number | STRING | Chemical Abstracts Service registry number | 50-81-7 |
common_name | STRING | Plain English common compound name | Diisostearyl Malate |
primary_function | STRING | Functional category (CosIng taxonomy) | Skin-Conditioning Agent |
comedogenic_rating | FLOAT | Pore-clogging likelihood scale (0.0 - 5.0) | 0.0 |
ewg_hazard_score | FLOAT | EWG Skin Deep hazard score (1.0 low - 10.0 high) | 1.0 |
is_common_allergen | INTEGER | FDA MoCRA contact dermatitis allergen (1/0) | 0 |
is_fungal_acne_trigger | INTEGER | Malassezia folliculitis feeding trigger (1/0) | 0 |
description | STRING | Scientific or regulatory monograph summary | Standard cosmetic emollient... |
cir_safety_verdict | STRING | Cosmetic Ingredient Review scientific verdict | Safe as used |
fda_warning | STRING | US FDA OTC warning monograph | No warning required |
cancer_hazard_flag | INTEGER | Carcinogenic hazard alert (1 = Flagged, 0 = Clean) | 0 |
endocrine_hazard_flag | INTEGER | Endocrine disruptor alert (1 = Flagged, 0 = Clean) | 0 |
Contains 19,847 commercial cosmetic formulations harvested from prestige retailers, K-Beauty standards, and clinical registries.
| Field Name | Data Type | Description | Sample Value |
|---|---|---|---|
product_id | INTEGER | Primary key unique identifier | 67455 |
brand_id | INTEGER | Foreign key referencing brands.brand_id | 46067 |
barcode_ean | STRING | Universal GTIN / EAN-13 barcode | P309308 |
name | STRING | Full commercial product title | Good Genes Lactic Acid Treatment |
category | STRING | Primary skincare taxonomy category | Skincare |
retail_price_usd | FLOAT | Retail price in USD | 85.0 |
raw_ingredient_text | STRING | Full unbroken ingredient listing from package label | Botanical Blend [Water/Eau/Aqua... |
created_at | STRING | Ingestion timestamp (ISO 8601) | 2026-07-02 14:01:48 |
Relational junction table containing 330,088 mappings preserving exact label order and concentration index.
| Field Name | Data Type | Description | Sample Value |
|---|---|---|---|
product_id | INTEGER | Foreign key referencing products.product_id | 67455 |
ingredient_id | INTEGER | Foreign key referencing ingredients.ingredient_id | 12404 |
position_index | INTEGER | Exact label position index (1 = highest concentration) | 1 |
concentration_percentage | FLOAT | Explicit clinical concentration percentage (when disclosed) | 10.0 |