Skip to content

Clarification on CSV Format Needed to Build a Converter for SenseMaker Integration #102

@bestian

Description

@bestian

Hello Red-Dwarf team,

I’m currently working on the development of SenseMaker, which is an opinion aggregator. It can analyze the comments portion of a Polis report (CSV format) and generate a human-readable summary from a large number of opinions.

Since we’re in discussion with the Agora-Citizen platform about integration, their CTO [Nicolas[(https://github.com/nicobao) mentioned that the core engine is Red-Dwarf, and suggested I reach out here to discuss the alignment of exported data.

Because CSV parsing requires matching specific column names, and I have made a converter that can transform Pol.is exported data into sensemaker's input format. I'm not sure if that converter can parse Red-Dwarf's exported data too. I’d like to ask:

Does the comments CSV format exported by Red-Dwarf use the same column names as Pol.is, or are there some differences?

If there are differences, I can write another converter to handle parsing, but I first need to know Red-Dwarf’s CSV format for comments before I can proceed.

Here is an example of Pol.is exported data (truncated from real data, with comment-id, agrees,disagrees,moderated,comment-body):

timestamp,datetime,comment-id,author-id,agrees,disagrees,moderated,comment-body
1681230375,Tue Apr 11 2023 16:26:15 GMT+0000 (Coordinated Universal Time),355,0,3,2,1,"Generative AI might bring about Athenian-like leisure time."
1680530603,Mon Apr 03 2023 14:03:23 GMT+0000 (Coordinated Universal Time),273,161,1,0,-1,"h"
1680124112,Wed Mar 29 2023 21:08:32 GMT+0000 (Coordinated Universal Time),0,0,5,17,1,"I'm excited about the possibilities of generative AI for targeting and mobilizing voters."
1680124115,Wed Mar 29 2023 21:08:35 GMT+0000 (Coordinated Universal Time),1,0,14,34,1,"If AI leads to economic growth and prosperity, that makes it good for democracy."

Here's converted version(fit sensemaker's input format, with comment-id, agrees, disagrees, passes, votes, comment_text):

timestamp,datetime,comment-id,author-id,agrees,disagrees,passes,votes,moderated,comment_text
1681230375,Tue Apr 11 2023 16:26:15 GMT+0000 (Coordinated Universal Time),355,0,3,2,0,5,1,Generative AI might bring about Athenian-like leisure time.
1680530603,Mon Apr 03 2023 14:03:23 GMT+0000 (Coordinated Universal Time),273,161,1,0,0,1,-1,h
1680124112,Wed Mar 29 2023 21:08:32 GMT+0000 (Coordinated Universal Time),0,0,5,17,0,22,1,I'm excited about the possibilities of generative AI for targeting and mobilizing voters.
1680124115,Wed Mar 29 2023 21:08:35 GMT+0000 (Coordinated Universal Time),1,0,14,34,0,48,1,"If AI leads to economic growth and prosperity, that makes it good for democracy."

If convenient, could you please paste a short sample of Red-Dwarf’s exported comments CSV here in this issue? That would help me figure out how to align the formats.

Thanks a lot!

Bestian

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions