Analytics in Action News

UCSF, Microsoft Develop Privacy-Protecting Data Analytics Tool

The data analytics platform will protect the privacy of clinical information to accelerate artificial intelligence in healthcare.

UCSF, Microsoft to develop privacy protecting data analytics tool

Source: Thinkstock

By Jessica Kent

- UC San Francisco’s Center for Digital Health Innovation (CDHI), Microsoft Azure, Fortanix, and Intel have partnered to design a privacy-preserving data analytics platform that will aim to advance healthcare artificial intelligence.

The platform will provide a zero-trust environment to protect both the intellectual property of an algorithm and the privacy of healthcare data, while CDHI’s proprietary BeeKeeperAI will provide the workflows to allow for more efficient data access, transformation, and orchestration across multiple data providers.

“Validation and security of AI algorithms is a major concern prior to their implementation into clinical practice. This has been an oftentimes insurmountable barrier to realizing the promise of scaling algorithms to maximize potential to detect disease, personalize treatment, and predict a patient’s response to their course of care,” said Rachael Callcut, MD, director of data science at CDHI and co-developer of the BeeKeeperAI solution.

“Bringing together these technologies creates an unprecedented opportunity to accelerate AI deployment in real-world settings.”

To gain regulatory approval for clinical AI algorithms, researchers need highly diverse and detailed clinical data to develop, optimize, and validate unbiased algorithm models. Algorithms used in the context of healthcare delivery have to be capable of consistently performing across diverse patient populations, socioeconomic groups, and geographic locations.

READ MORE: How Big Data Analytics Can Mitigate COVID-19 Health Disparities

Few research groups or healthcare organizations have access to enough high-quality data to accomplish these goals.

“While we have been very successful in creating clinical-grade AI algorithms that can safely operate at the point of care, such as immediately identifying life-threatening conditions on X-rays, the work was time consuming and expensive,” said Michael Blum, MD, associate vice chancellor for informatics, executive director of CDHI and professor of medicine at UCSF.

“Much of the cost and expense was driven by the data acquisition, preparation, and annotation activities. With this new technology, we expect to markedly reduce the time and cost, while also addressing data security concerns.”

Through the partnership, the organizations will leverage the confidential computing capabilities of Fortanix Confidential Computing Enclave Manager, Intel’s Software Guard Extensions (SGX) hardware-based security capabilities, Microsoft Azure’s confidential computing infrastructure, and UCSF’s BeeKeeperAI privacy preserving analytics to develop a proven clinical algorithm against a simulated dataset.

“Fortanix pioneered the use of Confidential Computing to secure sensitive data across millions of endpoints in industries such as financial services, defense, and manufacturing,” said Ambuj Kumar, CEO and co-founder of Fortanix.

READ MORE: How Big Data Analytics Models Can Impact Healthcare Decision-Making

“It is a privilege to work with UCSF and other technology innovators to use Confidential Computing to unlock the potential of healthcare data, and then create breakthroughs in clinical research that will help transform the healthcare industry and save lives.”

The team will use a clinical-grade algorithm that rapidly identifies those needing blood transfusion in the ED following trauma as a reference standard to compare validation results. Researchers will also test whether the model or the data were vulnerable to intrusion at any point.

Future phases will use HIPAA-protected data within the context of a federated environment, allowing algorithm researchers and developers to conduct multi-site validations. The ultimate goal is to support multi-site clinical trials that will accelerate the development of regulated AI solutions.

“Trusted execution environments enabled by Intel SGX could be key to accelerating multi-party analysis and algorithm training while helping to keep data protected and private. In addition, built-in hardware and software acceleration for AI on Intel Xeon processors enables researchers to stay on the leading edge of discovery,” said Anil Rao, vice president of data center security and systems architecture platform hardware engineering division at Intel.

“This collaboration with UCSF, Fortanix and Microsoft Azure demonstrates the amazing potential of confidential computing with Intel’s hardware-rooted protection defending the data.”

Confidential computing technology protects patient data privacy by enabling a specific algorithm to interact with a specifically curated dataset which remains in the control of the healthcare institution at all times via their Azure confidential cloud computing infrastructure.

The data will be placed into a secure enclave within Azure confidential computing, powered by Intel SGX and leveraging Fortanix cryptographic functions, including validating the signature of the algorithm’s image. The image will be processed in a separate enclave securely connected to another enclave holding the algorithm, ensuring multiple parties can leverage the system without needing to trust one another.

Through this partnership, the organizations will aim to develop secure, comprehensive data analytics platforms to advance innovative healthcare delivery.

“When researchers create innovative algorithms that can improve patient outcomes, we want them to be able to have cloud infrastructure they can count on to achieve this goal and protect the privacy of personal data,” said Scott Woodgate, senior director, Azure security and management at Microsoft Corp. “Microsoft is proud to be associated with such an important project and provide the Azure confidential computing infrastructure to healthcare organizations globally.”