A global financial services institution needed assistance with an internal investigation involving the chat messaging activity of one of its traders. To complete this investigation, a large volume of Instant Bloomberg data — stored in Bloomberg Vault — needed to be reviewed. In fact, the targeted data collection resulted in 200 GB of chat message data from 11 custodians, contained in 140 XML files (with attachments), to be reviewed in Relativity. It was critical that all chat participants were accurately identified and all messages were correctly reassembled.
A substantial but unaddressed technical challenge posed by chat messaging software is the fact that chat data, unlike email, does not contain easily identifiable header information, such as To/From/Cc/Bcc/Subject, making it difficult to correctly and completely track all participants in the message, including passive viewers. However, the complete and accurate identification of all participants in every message was imperative to performing their investigation.
Lighthouse was brought on to support the financial services institution through this internal investigation. To process and review chat data, the Lighthouse team leveraged ChatSmart, a proprietary application specifically built to address the complexities of chat and mobile messaging data.
First, the Lighthouse consultants evaluated the data set and conferred with the client to discuss the contents of the chat data and the repositories from which it was collected. Using this information, the team configured the app to ingest and process all 200 GB of Bloomberg chat messaging quickly and accurately.
ChatSmart’s unparalleled message-level granularity parsed additional, nonstandard metadata at the pre-processing stage and identified more than 134.5 million unique messages. These messages spanned more than 67,000 chat sessions, many of which had remained open for several years running, with up to 5,000 viewers coming and going during that time.
Using ChatSmart’s ability to track message components to the millisecond, the messages were reassembled with contextual and chronological fidelity. This granularity proved critical for the review.
The team then used the app to initially reduce the review volume by deduplicating at the message level. The remaining messages were then assembled into transcripts and loaded into Relativity, along with attachments.
Once in Relativity, the team delivered reports and consulted on several data analysis visualizations. And using ChatSmart’s Relativity modular plugin for self-service dynamic searching and filtering, the review team was able to quickly filter out the ever-present noise in the data set. This included notifications of joining and leaving, and other system events no longer of interest.
ChatSmart quickly processed the data with the reliability and technical accuracy necessary to create a data set with which the review team could easily interact and understand.
From the original 134.5 million unique messages, ChatSmart’s message-level deduplication excluded more than 120.4 million duplicative messages for an initial data reduction of 89.5%, even before the data was placed into Relativity. This not only cut down on review volume, but it also provided massive savings in recurring hosted database charges.
The ChatSmart team provided data visualizations that included a timeline that showed key time frames in the data set, and a node analysis that graphically depicted the communication patterns of most interest. With this information, the review team was able to quickly develop an efficient strategy for their investigation.
Once the strategy was established, the review team used the ChatSmart interface in Relativity to first drill down to just the most relevant 3% of the review set. Then they were able to further identify and focus on the .01% of the total data set that contained critical private chats between key traders.
Due to ChatSmart’s unique parsing and deduplication methodology, and its intuitive and powerful dynamic search and filtering Relativity interface, the review of a matter with more than 134.5 million unique messages required only 71 reviewer-hours to comprehensively and confidently complete.
For 25 years, Lighthouse has provided innovative software and services to manage the increasingly complex landscape of enterprise data for compliance and legal teams. Lighthouse leads by developing proprietary technology that integrates with industry-leading third-party software, automating workflows, and creating an easy-to-use, end-to-end platform. Lighthouse also delivers unique proprietary applications and advisory services that are highly valuable for large, complex matters, and a new SaaS platform designed for in-house teams. Whether reacting to incidents like litigation or governmental investigations, or designing programs to proactively minimize the potential for future incidents, Lighthouse partners with multinational industry leaders, top global law firms, and the world’s leading software provider as a channel partner.