Jump to content


Photo

EPG data distribution format


  • Please log in to reply
93 replies to this topic

#1 WanWizard

  • PLi® Core member
  • 70,554 posts

+1,813
Excellent

Posted 21 July 2017 - 13:13

Willy,

 

Would you mind contacting the admin's of all XMLTV EPG mirrors, and ask how much data they (roughtly) process in a month?

 

Over June 2017, my server has send out 18.8TB of EPG data, and that is with over a third of the requests denied (requests from Kodi and other non-enigma platforms).

 

This seems to become serious business, and I think if we want to be able to sustain this growth, we need to think about a more efficient data distribution system, instead of sending large files with repeated and largely unneeded information on every request. If we do that, we probably also have to discuss this with developers from CrossEPG, and maybe other enigma tools out there.

 

Or do you also want to cater for non-Enigma platforms like Kodi (in which case we have to discuss any new format with them as well).

 

Let me know your thoughts on this.


Currently in use: VU+ Duo 4K (2xFBC S2), VU+ Solo 4K (1xFBC S2), uClan Usytm 4K Ultimate (S2+T2), Octagon SF8008 (S2+T2), Zgemma H9.2H (S2+T2)

Due to my bad health, I will not be very active at times and may be slow to respond. I will not read the forum or PM on a regular basis.

Many answers to your question can be found in our new and improved wiki.


Re: EPG data distribution format #2 doglover

  • Rytec EPG Team
  • 17,409 posts

+658
Excellent

Posted 21 July 2017 - 15:02

I can produce smaller files with less info.  The number of files will however grow.  Problem is how to split up the different files.

And it will be a nightmare to go through the transition.  The grab files have to be manually sorted.

Also lets take the example of the French file:

 

This file contains the French channels and the Belgian channels from the French speaking part of the country.  If I split this out, everybody which want the Belgian channels which are in the Canalsat package have to download the Belgian package as well.  Etc. etc..

 

One thing I can do straight away is produce a news file.  This section is included in almost every other file, and thus make these file larger.  Then I could remove these files from all the other files.

But probably everybody will select the news section, and so downloading it.  The result would be zero, except more requests.

 

However this will not help. Due to the Kodi and IPTV users which have something like 2000 channels in their bouquet and want EPG for all the channels.

 

In the ideal world, I would like only to provide the EPG for enigma2 receivers.  But I do not know how to achieve that.

 

Willy


~~Rytec Team~~
Maxytec Multibox SE OpenPli (used as mediaplayer)
Mutant HD2400 OpenPli
Vu+ Duo OpenPli (backup)

Synology NAS

Sat: 13E, 19.2E, 23.5E and 28.2E
*Pli/Rytec EPG POWERED*


Re: EPG data distribution format #3 WanWizard

  • PLi® Core member
  • 70,554 posts

+1,813
Excellent

Posted 21 July 2017 - 15:10

It's not something that has to happen now, but something that has to be considered.

 

Alternatively we create a new importer, and use your existing data to convert into a different file format for OpenPLi users. In that case nothing will change for you, and for non-OpenPLi users, or users that don't use our importer (but use Cross for example).

 

This is probably the most transparent for you, and for the users, as there is no transition period for them.


Currently in use: VU+ Duo 4K (2xFBC S2), VU+ Solo 4K (1xFBC S2), uClan Usytm 4K Ultimate (S2+T2), Octagon SF8008 (S2+T2), Zgemma H9.2H (S2+T2)

Due to my bad health, I will not be very active at times and may be slow to respond. I will not read the forum or PM on a regular basis.

Many answers to your question can be found in our new and improved wiki.


Re: EPG data distribution format #4 WanWizard

  • PLi® Core member
  • 70,554 posts

+1,813
Excellent

Posted 21 July 2017 - 15:13

In the ideal world, I would like only to provide the EPG for enigma2 receivers.  But I do not know how to achieve that.

 

Difficult.

 

We thought about password protect the data, so you have to use the xmltvimporter and enter your username and password in the config. But as the code is open source, everyone can implement this feature, so it doesn't archieve anything.


Currently in use: VU+ Duo 4K (2xFBC S2), VU+ Solo 4K (1xFBC S2), uClan Usytm 4K Ultimate (S2+T2), Octagon SF8008 (S2+T2), Zgemma H9.2H (S2+T2)

Due to my bad health, I will not be very active at times and may be slow to respond. I will not read the forum or PM on a regular basis.

Many answers to your question can be found in our new and improved wiki.


Re: EPG data distribution format #5 doglover

  • Rytec EPG Team
  • 17,409 posts

+658
Excellent

Posted 21 July 2017 - 15:30

I could compress the files in xz format.

 

File sizes:

 

Benelux file: gz:5751 kb xz:3444kb

UK file: gz:6362 kb xz:3173 kb

 

There is a 40 - 50 % savings in traffic.

 

Willy


~~Rytec Team~~
Maxytec Multibox SE OpenPli (used as mediaplayer)
Mutant HD2400 OpenPli
Vu+ Duo OpenPli (backup)

Synology NAS

Sat: 13E, 19.2E, 23.5E and 28.2E
*Pli/Rytec EPG POWERED*


Re: EPG data distribution format #6 doglover

  • Rytec EPG Team
  • 17,409 posts

+658
Excellent

Posted 21 July 2017 - 15:44

The smallest files I could achieve with compression is with

 

xz - Ultra compression  (Extreme in linux version)

an almost the smae file size with

7x - Ultra compression

 

f.i the Benelux file became: 3216kb

 

Willy


Edited by doglover, 21 July 2017 - 15:57.

~~Rytec Team~~
Maxytec Multibox SE OpenPli (used as mediaplayer)
Mutant HD2400 OpenPli
Vu+ Duo OpenPli (backup)

Synology NAS

Sat: 13E, 19.2E, 23.5E and 28.2E
*Pli/Rytec EPG POWERED*


Re: EPG data distribution format #7 doglover

  • Rytec EPG Team
  • 17,409 posts

+658
Excellent

Posted 21 July 2017 - 16:03

In each case the importer does not understand the xz format:

C:\EPG\python>C:\EPG\python\OfflineImport.py  D:\EPG\python\test.sources.xml
[EPGImport] oudeis patch not detected, using epg.dat instead.
[EPGImport] failed to load C implementation, sorry
[EPGImport] nextImport, source= Test Benelux
[EPGImport] afterDownload V:\epg\BExmltv.xml.xz
[EPGImport] afterChannelDownload None
[EPGImport] Parsing channels from 'V:\epg\rytec.channels.xml.xz'
Traceback (most recent call last):
  File "C:\EPG\python\OfflineImport.py", line 68, in <module>
    importFrom(epgimport, xml)
  File "C:\EPG\python\OfflineImport.py", line 54, in importFrom
    epgimport.beginImport(longDescUntil = time.time() + (5*24*3600))
  File "C:\EPG\python\EPGImport.py", line 111, in beginImport
    self.nextImport()
  File "C:\EPG\python\EPGImport.py", line 120, in nextImport
    self.fetchUrl(self.source.url)
  File "C:\EPG\python\EPGImport.py", line 126, in fetchUrl
    self.afterDownload(None, filename, deleteFile=False)
  File "C:\EPG\python\EPGImport.py", line 191, in afterDownload
    self.afterChannelDownload(None, None)
  File "C:\EPG\python\EPGImport.py", line 210, in afterChannelDownload
    self.iterator = self.createIterator(filename)
  File "C:\EPG\python\EPGImport.py", line 129, in createIterator
    self.source.channels.update(self.channelFilter, filename)
  File "C:\EPG\python\EPGConfig.py", line 77, in update
    self.parse(filterCallback, self.urls[0])
  File "C:\EPG\python\EPGConfig.py", line 58, in parse
    for event, elem in iterparse(self.openStream(downloadedFile)):
  File "<string>", line 91, in next
cElementTree.ParseError: not well-formed (invalid token): line 1, column 0

Willy


~~Rytec Team~~
Maxytec Multibox SE OpenPli (used as mediaplayer)
Mutant HD2400 OpenPli
Vu+ Duo OpenPli (backup)

Synology NAS

Sat: 13E, 19.2E, 23.5E and 28.2E
*Pli/Rytec EPG POWERED*


Re: EPG data distribution format #8 Abu Baniaz

  • PLi® Contributor
  • 2,501 posts

+64
Good

Posted 21 July 2017 - 16:44

I dont see why you have Virgin and Freesat EPG.

Re: EPG data distribution format #9 doglover

  • Rytec EPG Team
  • 17,409 posts

+658
Excellent

Posted 21 July 2017 - 17:34

I dont see why you have Virgin and Freesat EPG.

This is a bad example.

They all use the same XMLTV (data) file.  In this case only the channels file (links between the actual channel and the EPG) is different.

Willy

 

In the past I had to split off the Virgin channels, as some references resembled too much references present on the sat.  And in this case the crossepg complained too much.


Edited by doglover, 21 July 2017 - 17:35.

~~Rytec Team~~
Maxytec Multibox SE OpenPli (used as mediaplayer)
Mutant HD2400 OpenPli
Vu+ Duo OpenPli (backup)

Synology NAS

Sat: 13E, 19.2E, 23.5E and 28.2E
*Pli/Rytec EPG POWERED*


Re: EPG data distribution format #10 littlesat

  • PLi® Core member
  • 57,198 posts

+700
Excellent

Posted 21 July 2017 - 17:47

Why not distribute it over a torrent system? Or usenet?

WaveFrontier 28.2E | 23.5E | 19.2E | 16E | 13E | 10/9E | 7E | 5E | 1W | 4/5W | 15W


Re: EPG data distribution format #11 Lost in Space

  • Senior Member
  • 876 posts

+69
Good

Posted 21 July 2017 - 18:22

Blockchain it - then you are in full control :D



Re: EPG data distribution format #12 doglover

  • Rytec EPG Team
  • 17,409 posts

+658
Excellent

Posted 21 July 2017 - 19:22

Blockchain it?

you have lost me here. Care to explain?

Willy

~~Rytec Team~~
Maxytec Multibox SE OpenPli (used as mediaplayer)
Mutant HD2400 OpenPli
Vu+ Duo OpenPli (backup)

Synology NAS

Sat: 13E, 19.2E, 23.5E and 28.2E
*Pli/Rytec EPG POWERED*


Re: EPG data distribution format #13 WanWizard

  • PLi® Core member
  • 70,554 posts

+1,813
Excellent

Posted 21 July 2017 - 21:21

A funny note, without any useful contribution, as usual.

 

There is no reason to use a blockchain for EPG data, and I don't see a reason for using a torrent system either (that will use part of the end-users upstream, and some don't have a lot).

 

I'll have a good think about it.


Currently in use: VU+ Duo 4K (2xFBC S2), VU+ Solo 4K (1xFBC S2), uClan Usytm 4K Ultimate (S2+T2), Octagon SF8008 (S2+T2), Zgemma H9.2H (S2+T2)

Due to my bad health, I will not be very active at times and may be slow to respond. I will not read the forum or PM on a regular basis.

Many answers to your question can be found in our new and improved wiki.


Re: EPG data distribution format #14 Abu Baniaz

  • PLi® Contributor
  • 2,501 posts

+64
Good

Posted 21 July 2017 - 23:05

 

I dont see why you have Virgin and Freesat EPG.

This is a bad example.

They all use the same XMLTV (data) file.  In this case only the channels file (links between the actual channel and the EPG) is different.

Willy

 

In the past I had to split off the Virgin channels, as some references resembled too much references present on the sat.  And in this case the crossepg complained too much.

 

Sorry, I did not phrase that properly.

 

The EPG data is on the signal stream. People will start using it if they cant get the EPG using Rytec.



Re: EPG data distribution format #15 doglover

  • Rytec EPG Team
  • 17,409 posts

+658
Excellent

Posted 22 July 2017 - 07:17

A dutch mirror reports this:

 

Ik houd tegenwoordig zelf niet echt statistieken bij en mijn hosting provider ook niet, maar de netwerk interface geeft aan dat er in de 73 dagen uptime zo'n 22,9 Terabyte is verstuurd. Dus grofweg 9,5TB in een maand, ongeveer de helft van zijn server.

 

Kleinere bestanden helpt alleen als de clients niet doorgaan met het opnieuw downloaden voor de dagen die ze al eens hebben verwerkt. Dus als ze 7 dagen vooruit zouden downloaden, wordt de 7e dag in principe nog steeds 7x keer gedownload. Dan zouden de clients en de data slimmer moeten worden waarbij bijvoorbeeld alleen een dag opnieuw zouden downloaden als er daadwerkelijk iets veranderd (of nog beter zou zijn dat ze alleen de verschillen krijgen).

 

Comment:

 

Splitting the files in daily chunks, would not help.   Since each day the daily files are checked for changes.  And there are always changes.  So each day all daily files are changed, and even if the importer is so smart only to download changed files, it will still download the full list, since all files are changed every day.

 

Willy


~~Rytec Team~~
Maxytec Multibox SE OpenPli (used as mediaplayer)
Mutant HD2400 OpenPli
Vu+ Duo OpenPli (backup)

Synology NAS

Sat: 13E, 19.2E, 23.5E and 28.2E
*Pli/Rytec EPG POWERED*


Re: EPG data distribution format #16 doglover

  • Rytec EPG Team
  • 17,409 posts

+658
Excellent

Posted 22 July 2017 - 15:03

French Mirror:

 

 

From a quick eye I can see that my server has around 60-80 TB of traffic / month.
Attached you will find the screenshot of the last 24hrs.

With an average rate of 200-250 Mbps the results are 65-80 TB / month.
Is OK for me as I have unlimited plan on my dedicated servers located in OVH France.
My server has 1Gbps interface with 500Mbps guaranteed bandwidth,
The true is that is a lot of data but it dosn't bothers me a lot as things goes smooth and users are happy having EPG's.

As for your idea to find another way to distribute EPG's instead of sending those large files, a possible solution is a database where the EPG's will be stored and the users will download epg's only for the requested channels. If you have a MAG device (mag 250, mag 254, etc) you will see that uses this method. If OpenPLI developers can simulate this, then this might be the solution.

 

 

Willy

 

As you can see in the screenshot there are a lot of people who do not read forums.  Otherwise they would know that downloading from midnight until let's say 08h00 will only yield yesterday's file.

Attached Files


~~Rytec Team~~
Maxytec Multibox SE OpenPli (used as mediaplayer)
Mutant HD2400 OpenPli
Vu+ Duo OpenPli (backup)

Synology NAS

Sat: 13E, 19.2E, 23.5E and 28.2E
*Pli/Rytec EPG POWERED*


Re: EPG data distribution format #17 MiLo

  • PLi® Core member
  • 14,055 posts

+298
Excellent

Posted 22 July 2017 - 15:32

In each case the importer does not understand the xz format:


We could teach it that though. Either using the lzma backport Python module, or just spawning a subprocess and passing the data through busybox' xz decompressor.

(In case you hadn't noticed, lzma and xz are basically the same thing)
Real musicians never die - they just decompose

Re: EPG data distribution format #18 doglover

  • Rytec EPG Team
  • 17,409 posts

+658
Excellent

Posted 22 July 2017 - 16:36

 

In each case the importer does not understand the xz format:


We could teach it that though. Either using the lzma backport Python module, or just spawning a subprocess and passing the data through busybox' xz decompressor.

(In case you hadn't noticed, lzma and xz are basically the same thing)

 

 

If you can do so, it would be appreciated.  It is not a miracle thing, but it helps.

Pls. leave the existing import of gz and xml files intact.

 

Willy


Edited by doglover, 22 July 2017 - 17:17.

~~Rytec Team~~
Maxytec Multibox SE OpenPli (used as mediaplayer)
Mutant HD2400 OpenPli
Vu+ Duo OpenPli (backup)

Synology NAS

Sat: 13E, 19.2E, 23.5E and 28.2E
*Pli/Rytec EPG POWERED*


Re: EPG data distribution format #19 MiLo

  • PLi® Core member
  • 14,055 posts

+298
Excellent

Posted 22 July 2017 - 18:58

Did a quick implementation and added a recipe to the "develop" branch. If you could test it that would be nice...
Real musicians never die - they just decompose

Re: EPG data distribution format #20 doglover

  • Rytec EPG Team
  • 17,409 posts

+658
Excellent

Posted 23 July 2017 - 08:48

How do I install it?

Sorry about that.  I can compile on a linux machine, but I never know what I am getting.

 

But I see on the Github it is just a python file you have changed.

Question is it enough I replace the python file on my receiver?  Or do I need to install something more?

 

Willy


~~Rytec Team~~
Maxytec Multibox SE OpenPli (used as mediaplayer)
Mutant HD2400 OpenPli
Vu+ Duo OpenPli (backup)

Synology NAS

Sat: 13E, 19.2E, 23.5E and 28.2E
*Pli/Rytec EPG POWERED*



2 user(s) are reading this topic

0 members, 2 guests, 0 anonymous users