Skip to content
This repository has been archived by the owner on Aug 25, 2020. It is now read-only.
/ EPTCScrapper Public archive

Scrapper do site da EPTC de Porto Alegre que coloca os horários de onibus do site em um banco de dados SQLite

Notifications You must be signed in to change notification settings

betoharres/EPTCScrapper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Usage

  1. install the gems required in the scripts(scrap.rb, models.rb, EPTCBus.rb)

  2. run: $ ruby scrap.rb

It will output a .db file

NOTES:

  • one url at a time otherwise EPTC's website will go down
  • entire process takes ~40min

Bus example:

sqlite> select * from buses where id=1;
        id = 1
identifier = 256-44
      name = INTENDENTE AZEVEDO (BACIA PÚBLICA)
      code = 2564
       url = http://www.eptc.com.br/EPTC_Itinerarios/Cadastro.asp?Linha=256-44&Tipo=TH&Veiculo=1&Sentido=0&Logradouro=0&Action=Tabela

Schedule example:

sqlite> select * from schedules where id=1;
           id = 1
    direction = 2
     day_type = 1
         time = 06:48
stop_datetime = 2020-01-01 06:48:00
  is_handicap = 1

direction_types:

  unknown: 0         , circular: 1        ,
  bairro_centro: 2   , centro_bairro: 3   ,
  bairro_terminal: 4 , terminal_bairro: 5 ,
  norte_sul: 6       , sul_norte: 7       ,
  norte_leste: 8     , leste_norte: 9     ,
  leste_sul: 10      , sul_leste: 11      ,

day_types:

 unknown: 0, mon_fri: 1, saturday: 2, sunday: 3

bonus(how many bus schedules supports wheelchair):

sqlite> select count(*) from schedules where is_handicap=1;
count(*) = 16266
sqlite> select count(*) from schedules where is_handicap=0;
count(*) = 25310

About

Scrapper do site da EPTC de Porto Alegre que coloca os horários de onibus do site em um banco de dados SQLite

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages