Customised online search engine for legal documents

 
Customised online search engine for legal documents

ActaPublica.se is Sweden’s largest document database and research company. They have an archive of around 20 years worth of data and reports. They provide comprehensible information, surveys and compilations to enable in-depth analysis and strategy planning.

ActaPublica also provides real-time monitoring services to gain knowledge of products, people, companies or even competitive brands. 

This platform uses an Elasticsearch result rank algorithm to show the best matching documents on top of the result set. Users can also use API services to integrate the document search to other applications.

The application core is built on Laravel framework and uses Node.js for real-time communication.

Key features:

Wildcard search: The user can input wildcard expressions to yield better and more relevant results. S/he can use symbols like *, which could match a particular character sequence or ‘?’ which could match a single character from the keyword.

Agent feature: The user can save search criterias of interest for future updates, under ‘agent’ for easier search. They will be notified about new matching documents via real-time browser notifications and by email.

Application programming interface: Unique API access credentials can be used to access data from an account, to build custom services and solutions.

Live browser notifications: A real-time socket connection is used to push browser notifications to the user on finding new documents matching agent criteria.

Activity logs: The various stages of agent processing is logged in the backend to keep track of the agent processing events to aid for auditing and debugging.

Request throttling: All API/download requests can be regulated at the user/organization level by the backend administration.

Multilingual support: The application supports seven major languages – English, Danish, German, Spanish, French, Italian, Portuguese and Swedish. A user can easily switch between languages from the footer of the platform.

Technical information:

AWS services: The application is running on high performance EC2 instance to provide hassle-free experience to the end users. The system uses AWS S3 to store the documents and deliver results to the application upon requests for document previews and downloads.

Real-time data broadcasting: The notification alert will be broadcasted to the users through a socket channel and they will get notified via browser notifications. It is accomplished by integrating Node.js, Socket.io and Redis in the system.

Elastic cloud: The Elasticsearch store is hosted on Elastic cloud enabling us to use the latest Elasticsearch version and making available features like Elastic X-Pack, extensive monitoring capabilities, snapshotting, etc.

Database: The MySQL database of the application has been configured using Amazon RDS service for better data security and easy backup. The system also uses MongoDB for the agent activity logs storage.

Responsive design: The website is built over the Bootstrap framework to provide a responsive and optimized user experience on all devices and orientations.

Coding standards: The project uses PSR-2 coding standards to keep the code readable and easily maintainable with proper code comments and PHPDoc blocks.

Future challenges

Improved UX – We would need to continually improve user experience based on feedback from users and industry updates to make it easier for users to get the best out of the platform.

Elasticsearch – We plan to use the modern features available in Elasticsearch queries to refactor the queries, for faster results and improving the general experience of the system.

Faster notifications – The aim is to further improve the turn-around time for mail notifications by making use of various modern solutions such as AWS Lambda.

CI/CD – We are working towards making releases easier, faster and automated using Bitbucket pipelines.

Improve code coverage – We will make better use of design patterns like repository pattern and improve unit-test coverage.

Better scalability – LiteBreeze experts plan to take full advantage of AWS services like EC2 autoscaling to improve the scalability of the application.

Customised online search engine for legal documents
We trust LiteBreeze with all our web development project work in Siren and Acta Publica. The team developed the archive service for Acta Publica which all media companies in Sweden rely daily. We are excited about ongoing projects for Siren and recommend LiteBreeze for their AWS expertise as well. - Martin Fredriksson
Team of developers who worked on this project: Preeth, Saji, Dileep, Arjun KB, Sahal