Compar:IA conversations

License icon

License:

Etalab 2.0

Shield icon

Steward:

ComparIA

Task: NLG

Release Date: 1/19/2026

Format: PARQUET

Size: 1.81 GB


Share

Description

The compar:IA dataset is a large-scale collection of real user conversations generated on the compar:IA platform, a public conversational AI comparison service developed within the French Ministry of Culture. The platform allows users to interact with two conversational AI models side by side and compare their answers in a blind setting. Its goals are both educational, by helping users understand how different models behave, and technical, by contributing open alignment and evaluation data, with a strong focus on French-language use. The dataset contains nearly 400,000 paired conversations produced by more than 30 conversational AI models, including both open-source and proprietary systems. Most interactions are in French and reflect unconstrained, real-world uses of conversational AI across a wide range of topics, such as writing, programming, administration, creative tasks, and everyday questions. Prompts are not curated or engineered, which makes the data representative of actual user behavior rather than benchmark-style evaluations. Each entry includes the full conversation with two compared models, along with metadata linking the pair, identifying the models, and describing the dialogue structure. Additional fields provide automatically generated summaries, keywords, thematic categories, detected languages, and usage metadata. The dataset also includes estimates related to model size, output token counts, and electricity consumption, enabling analysis of performance and efficiency trade-offs. User consent is collected through the platform’s terms of use. Automated detection is used to identify and anonymize personally identifiable information, but no filtering is applied to remove potentially toxic or sensitive content, in order to support research on safety and real-world risks. The dataset is released under the open Etalab 2.0 license and is intended for research and development in conversational model alignment, evaluation methods, human–AI interaction, and AI safety, particularly for French and other under-resourced languages.

Considerations

Restrictions/Special Constraints

Please beware that some model outputs are not reusable.

Forbidden Usage

-

Processes

Intended Use

This dataset is intended to be used for the evaluation and comparison of large language models through human preference judgments, with a focus on non English and under resourced languages. It supports research on model quality, alignment, and multilingual performance, as well as public interest analysis and open benchmarking.

Metadata