I downloaded a dataset of facebook messages and it was formatted like this:
f\u00c3\u00b8rste student
It's supposed to be første student
but I cant seem to decode it correctly.
I tried:
str = 'f\u00c3\u00b8rste student'
print(str)
# 'første student'
str = 'f\u00c3\u00b8rste student'
print(str.encode('utf-8'))
# b'f\xc3\x83\xc2\xb8rste student'
But it did't work.