- In 5 months, Lighthouse migrated four databases—with 25 TBs of data—all while keeping the databases active for review and production for current matters.
- Leveraging our AI technology, Lighthouse created an innovative solution for a large volume of Lotus Notes files originally processed as HTML files by a legacy processing tool. This solution ensured that any new Lotus Notes files would deduplicate against the migrated data, regardless of the file type or the tool used for processing.
A Challenging Data Deduplication Problem
A large healthcare system had been hosting its data (over 25 TBs of data across four databases) on another vendor’s platform for nearly a decade. The company knew it was time to modernize its eDiscovery program with Lighthouse. In order to do so, all 25 TBs would need to be migrated over to Lighthouse for hosting and future processing. However, in addition to data migration, the company also had a unique deduplication challenge due to the previous vendor’s original processing tool.
The company’s data had originally been processed with the vendor’s legacy processing tool—which processed Lotus Notes data as HTML files, rather than the more modern EML version. The prior processing of these files into an HTML format meant that whenever duplicate Lotus Notes files were added to the database and processed using a more modern processing tool, those EML files would not deduplicate against the older HTML files in the databases.
With over half their data consisting of Lotus Note files processed by the older tool in HTML format, the company was concerned that this issue would significantly increase review cost and slow down review time.
Thus, in addition to the overall migration process, the company came to Lighthouse with an unfortunate Catch-22: in order to modernize its processing and eDiscovery capabilities, it was losing the ability to deduplicate a majority of its data with each new ingestion.
Lighthouse Migration Expertise
Because of the volume of new clients moving to Lighthouse for eDiscovery support, Lighthouse has developed an entire practice group dedicated to data migration. This group is adept at creating customized solutions to the unique challenges that often arise when migrating data out of legacy systems. The team works closely with each client to understand the scope, types of data, challenges, and future needs so that the data migration process is seamless and efficient.
The Lighthouse migration team quickly got to work gathering information from the healthcare company to start this process, paying particular attention to the Lotus Notes deduplication issue. Once all relevant information was gathered, Lighthouse worked with stakeholders from the organization to form a comprehensive migration plan that minimized workflow disruption and included a detailed schedule and workflow for future data. In the process, Lighthouse also developed a custom solution for the Lotus Notes issue using our proprietary AI technology.
An Innovative Solution: Lighthouse AI
Lighthouse’s advanced AI technology can create a unique hash value for all data, no matter how it was originally processed. The Lighthouse migration team leveraged this innovative technology to create a unique hash value for the Lotus Notes files that were originally processed as HTML files. That hash value could then be matched against any new Lotus Notes files that were added to the database by the company, even when those files were processed as EML files. With this proprietary workflow, the healthcare company was able to seamlessly move to Lighthouse’s eDiscovery platform, which was better equipped to serve its eDiscovery needs—without losing the ability to deduplicate its data.
Set Up for Success
In just five months, Lighthouse completed a seamless migration of the healthcare company’s data by creating a custom migration plan that minimized blackouts and kept all databases up and running.
Importantly, Lighthouse also leveraged its proprietary AI to create an innovative solution to a complex problem, ensuring continued deduplication capability and reduced discovery costs.