Hurdles and solutions
At MindTitan, we thrive on challenges, exploring the edge of innovation and crafting solutions that push boundaries. In our journey with Gratitude Service, we came across a number of hurdles that were unique and multifaceted, requiring an innovative approach. Through a series of strategic efforts, we were able to design solutions that ensured both the functionality and efficiency of our AI-driven system. Let’s delve into the challenges we encountered and the solutions we engineered.
Architecture challenges:
Our project architecture faced two significant challenges that required innovative solutions.
The first challenge centered around privacy concerns, primarily driven by the GDPR regulations and the presence of people in the cemeteries. Our priority was to ensure that the privacy rights of individuals were respected while we gathered and processed the data. To address this, we utilized specially designed AI models that could effectively anonymize images. In practice, this meant processing each image to remove or obscure identifiable features of any individuals present in the cemetery. This approach not only maintained privacy standards but also ensured our compliance with GDPR regulations. By leveraging these models, we managed to maintain the data integrity necessary for our operations without infringing on personal privacy.
The second challenge we faced was a slow interface, which was caused by the high-resolution image requests made by the AI model. High-resolution images, while offering more detail, also required more computational resources and time to process. This results in a sluggish interface, hampering the efficiency of the system. To tackle this automatically, we implemented scripts capable of scaling a set of images for labeling. Moreover, we converted the high-resolution images into a lower-quality format that, while still retaining essential details, allowed for faster loading and smoother operation. Through these modifications, we managed to streamline the interface while ensuring the system continued to provide accurate and meaningful results.
Data Challenges:
The complexity and unique features of cemetery landscapes presented us with a multitude of challenges that conventional, off-the-shelf, or synthetic data collection methods failed to meet adequately. These landscapes, marked by their intricate patterns and varying structures, generate a plethora of visual data that include multiple tricky edge cases – each requiring careful attention and analysis.
The sheer diversity and variability in tombstone designs, inscriptions, and placements, compounded by different lighting and weather conditions, make the data collection process intricate. It’s not just about getting any data; it’s about gathering high-quality, meaningful data that our AI can learn from to make accurate predictions.
Given these challenges, we have opted for a proprietary solution to collect the data needed for our operations. This solution, designed to handle the complexities inherent in our unique industry, involves a robust and comprehensive data pipeline that ensures the systematic collection, processing, and management of data.
Although implementing this solution requires significant effort and resources, the resultant data serves as a rich resource for training our AI models. These models then provide accurate and valuable insights about the cemetery landscapes and tombstone locations, driving our ambition to create the world’s largest cemetery database.
-
Labeling Tombstones in Each Camera Frame
The challenge of labeling tombstones in every frame of our camera recordings is quite a monumental task. The complexity of the cemetery landscapes’ diversity, described above, multiplies when you factor in the vast quantity of frames captured during each recording session. It is not just about identifying tombstones, but also about understanding their relative positions, inscriptions, and sometimes their historical contexts.
To address this, we’ve engineered a scalable model pipeline. This pipeline is adept at handling and processing a wide range of image quantities efficiently, ensuring that no tombstone goes unlabeled, regardless of the number of frames we capture.
This solution allows us to break down the colossal task of labeling into manageable chunks. As a result, our AI models can systematically learn and predict from a comprehensive and growing database, improving their ability to recognize and understand tombstones. The scalable nature of our pipeline ensures that, as we grow and capture more images, our system can handle the increased load while maintaining the same level of efficiency and accuracy.
One significant hurdle we encountered during our data collection and analysis process is dealing with unclear pictures of tombstones. It is a challenge to recognize engravings on images that are blurred or of poor quality due to camera movements or compression of the lighting. Pictures of tombstones which are centuries old, for example, from the eighteenth century, with their long history, tend to have engravings that are not clearly legible due to erosion and weathering over the years.
In order to effectively handle this challenge, we implemented Optical Character Recognition (OCR) solutions to improve the accuracy of text detection on tombstones. OCR is a technology that converts different types of documents, such as scanned paper documents, PDF files or images, captured by a digital camera, into editable and searchable text data. By using OCR technology, even faint and weathered engravings could often be translated into usable data.
However, OCR technology alone wasn’t enough to ensure the best quality data extraction, particularly with the level of complexity and variability presented by the old tombstones. To further refine the process, MindTitan also implemented text-cleaning scripts. These scripts worked to reduce any noise in the OCR detections, thereby enhancing the quality and accuracy of the extracted information. The result was a far cleaner, more consistent and accurate set of data derived from the engravings.
Despite our best efforts and advanced technology, it is important to understand and acknowledge the inherent limitations of the dataset. Some tombstones, due to extreme weathering or damage, may have inscriptions that are simply too faint or eroded to be accurately detected or deciphered even by humans standing in front of them. In such cases, our approach is to focus on maximizing the extraction of information from those tombstones that have readable inscriptions.