Dealing with character sets

Clinton Jones
Clinton Jones Experian Elite
edited December 2023 in General

Sometimes, the file that want to use with Data Studio will contain characters that would come into data studio incorrectly. Consider this file, countries.csv

Note that in the Preview and configure the letter Å (å in lower case) an overring A is common in Swedish, Danish, Norwegian, Finnish, North Frisian, Walloon, Chamorro, Lule Sami, Skolt Sami, Southern Sami, and Greenlandic alphabets and also found in Alemannic and Austro-Bavarian dialects of German doesn't show up correctly.

There are many additional letters from the latin alphabet that behave in a similar fashion either ligatured or phonetic letters may not render correctly or may not be correctly identified in the auto-parsing of files.

Often this is due to the misidentification of the character-set. You can change this in the preview or configure by simply changing the character set to something more appropriate.

A common choice would be to choose WIndows-1252

The result then would be a more correct view of the underlying data



Tagged: