
Over the past few years, a range of interconnected developments has driven this shift—particularly within the medical domain. Some of the most significant include:
🔹 The widespread expansion of the internet and social media.
🔹 Remarkable advances in Artificial Intelligence (AI) and quantum computing.
🔹 Breakthroughs in biotechnology, genomics, and the broader “omics” sciences.
🔹 The growing adoption of telemedicine and wearable technologies.
🔹 The accelerated development of personalized medicine.
🔹 A significant increase in data storage and processing capabilities (one of Cromodata’s areas of expertise), enabling access to and utilization of large-scale datasets.
When combined with the rising costs and well-documented limitations of traditional clinical trials, real-world data emerges as a powerful resource to help bridge the gap between clinical research and real-world practice.
Some of the most valuable uses of RWD in healthcare are already well known. Recruiting patients for clinical trials, comparing the effectiveness of drugs and treatments, and monitoring their safety are among the most common in the pharmaceutical world.
Up next, we’ll highlight some of the newer and more promising use cases.
Real-World Data (RWD) comes from many sources, but the most common are: Electronic Health Records (EHRs), patient registries, administrative databases (including clinical records, insurance data, and billing information), disease registries, and databases on pharmaceutical products and medical devices.
RWD can be more or less accessible, structured, and refined for use — and that determines how quickly and easily statistical analysis and predictive models can be applied. The ultimate goal is to generate Real-World Evidence (RWE) that can be used to draw conclusions, validate hypotheses, design studies, support regulatory decisions, develop public health policies, or even guide clinical practice.
One of the biggest challenges today in Latin America — and in many parts of the world — is the high level of data fragmentation, which makes RWD hard to use. That’s where a promising niche is emerging: ensuring access to medical datasets that are both interoperable and secure.
This will be key from now on for most advances in medicine — from training AI and developing new drugs, to scientific discoveries, medical research, and precision medicine. It will also help address unmet medical needs, study hard-to-reach subpopulations, and evaluate the long-term safety and effectiveness of treatments.
In the figure below, you’ll see the most common and widely used sources of RWD today.
In recent years, several categories of RWD have gained particular relevance — among them laboratory and genomic data (with spatial genomics standing out), pharmaceutical data, oncology data and data on both prevalent and rare diseases, information on social determinants of health (SDOH), and records from specialty pharmacies. Health data provide a unique opportunity to deepen our understanding of rare diseases. In this regard, biopharmaceutical companies often face challenges in recruiting sufficiently large study populations.
“Omic” data — including genomic, epigenomic, microbiomic, pharmacogenomic, transcriptomic, proteomic, and metabolomic datasets, among others — offer transformative potential for both healthcare research and clinical application. Genomic data, in particular, are growing in prominence due to the increasing use of biomarker-targeted therapies, which are central to today’s precision and personalized medicine approaches. Spatial genomics — which merges genomic or transcriptomic sequencing with spatial localization techniques — is an emerging field within omics sciences. It shows strong market potential and valuable clinical applications, particularly in cancer, neuroscience, inflammation and autoimmune diseases, and embryonic development.
Pharmacy-related data are also highly valuable in the context of specialty medications, which currently represent approximately 75% of drugs in development.
Medical imaging data — comprising an estimated 90% of all healthcare data — are playing a central role in the development and validation of new artificial intelligence tools.
The increased availability of medical imaging within real-world data (RWD) in recent years has significantly fueled the development of machine learning (ML) algorithms aimed at enhancing diagnostic precision. Medical imaging represents a highly complex data type, yet one with tremendous potential for disease detection, diagnosis, and monitoring.
When properly used and analyzed, RWD holds the potential to generate valid and unbiased RWE — offering significant cost and time savings compared to controlled clinical trials— and to improve the efficiency of medical and health-related research and decision-making.
Today, there are three main challenges to advancing AI: algorithms, computing power, and data. While the first two have well-established markets supporting them, getting access to quality data for training AI remains a big challenge — and an even greater one in Latin America.
That’s exactly why we do what we do at CROMODATA.