Launch of Czech WTT site moved to half of May
Considering the current state of development of the Czech WTT clone it was decided to move the launch of the application about three weeks later than originally planned – from the end of April to half of May. The main reason is more complicated data acqusition than expected. The MPs' contact information and membership information needed for WTT are scraped from official parliament website together with other MP data. Recently, while actual scraping of the data a number of wrong records was discovered on the official websites.
The wrong records were identified thanks to an automatic basic checking of the scraped information that we have implemented to assure quality of the data in our database. Such checking is aparently missing on the official parliament website. The most common errors are duplicate memberships of MPs in groups, swapped dates, or groups matched with wrong term of office of the parliament. The first discovered error was reported to the administrator of the parliament website and it was instantly corrected there. Based on this we plan to send a more extensive report on the identified errors in the following days and to help to improve the quality of the offcial data.
Besides, the international development branch of the WTT application itself (ancestor of the Lithuanian parasykjiems.lt) that serves as a base for Czech and Slovak version is pulled from the repository now and trying to get running on my local computer.