summary.txt encoding incorrect #36

Open
opened 5 months ago by jamesapyrich · 2 comments
Collaborator

had a recent "summary.txt" that contained an accented "e" character:

I had a deep conversation with a caller who opened up about concerns regarding their brother's engagement, particularly due to the presence of abuse within his fiancée's family.

Showed up in the media manager as:

I had a deep conversation with a caller who opened up about concerns regarding their brother's engagement, particularly due to the presence of abuse within his fianc�e's family.

This is a text encoding issue, most likely the default Windows codepage being used when the file is in UTF-8. I'm not 100% certain how to resolve this generally, but my automation is outputting the text in UTF-8, and that's very often used in web pages as well, so the automation functions that read from text files should be using UTF-8 as well.

had a recent "summary.txt" that contained an accented "e" character: ``` I had a deep conversation with a caller who opened up about concerns regarding their brother's engagement, particularly due to the presence of abuse within his fiancée's family. ``` Showed up in the media manager as: ``` I had a deep conversation with a caller who opened up about concerns regarding their brother's engagement, particularly due to the presence of abuse within his fianc�e's family. ``` This is a text encoding issue, most likely the default Windows codepage being used when the file is in UTF-8. I'm not 100% certain how to resolve this generally, but my automation is outputting the text in UTF-8, and that's very often used in web pages as well, so the automation functions that read from text files should be using UTF-8 as well.
yuriy commented 5 months ago
Owner

I think this is an issue with how Selenium handles the encoding when sending it to the page, i'll have to rewrite that input to use javascript through Selenium instead.

This is also the reason that emoji's in Video Titles and descriptions cause issues for some sites.

I think this is an issue with how Selenium handles the encoding when sending it to the page, i'll have to rewrite that input to use javascript through Selenium instead. This is also the reason that emoji's in Video Titles and descriptions cause issues for some sites.
yuriy added the
Priority-B
label 5 months ago
yuriy commented 3 months ago
Owner

I wasn't able to replicate this, so i added the javascript formatted summary text to the errorlog. Next time you catch this happening we can hopefully narrow down the cause a bit.

I wasn't able to replicate this, so i added the javascript formatted summary text to the errorlog. Next time you catch this happening we can hopefully narrow down the cause a bit.
Sign in to join this conversation.
No Milestone
No project
No Assignees
2 Participants
Notifications
Due Date
The due date is invalid or out of range. Please use the format 'yyyy-mm-dd'.

No due date set.

Dependencies

No dependencies set.

Reference: yuriy/video-uploader#36
Loading…
There is no content yet.