There has been some "work" on multi-byte encodings lately. I wouldn't be surprised that either the xml input contains invalid multi-byte sequences or there is a bug in the multi-byte processing or both. I am seeing invalid UTF-8 sequences in EPG quite frequently, so that is surely a part of the problem.
Problem is, the EPG is gathered from many different sources, and not everybody adheres to the same standard.
We can correct most errors (if not all). But for this to happens, we will need at least a channel which has the problem, in order to locate the website (and grabber) which needs adjusting.
At the moment, I cannot identify even the channel which causes the problem. So let alone correct the grabber for it. And when not knowing where you are looking for, making changes is a gamble.
Moreover, the mipsel variants of enigma2 does not have a problem with this. And since I have only mipsel receivers, I can not even test this.
We will try always to get it correct, but it is not easy if we do not even know were to look.
So Eric, why not report the invalid UTF-8 sequences when you see them. (Look up the service ref of the channel, because this helps me to identify the exact channel and attached EPG)
Also make a screen shot of the error. Which helps me finding the error. (Trying to find a single occurrence in a 30 MB file is not easy)
Willy
Edited by doglover, 6 September 2016 - 10:22.