It is a pleasure working with Gera-IT on our projects. They have a very structured way of working which gives us insight in progress and cost of our project. We recommend working with Gera-IT team on your programming efforts!
Application provides ability for the site owners to scan their web-sites and gather such information:
Client side is built on Ruby on Rails framework, while for Processing part Java based frameworks (Solr and Nutch) are used. Their communications is built through API.
The architecture was designed in a scalable way, where the application can process sites with hundred thousands of pages and even more. Our team developed the application from scratch, both RoR and Java sides. This application is a good example of how two technologies can be combined in order to achieve best results in short terms.
Most interesting parts in the application from engineering point of view are: