2 Analyze Health Data
Data Science Service (DSS) on Oracle Cloud Infrastructure (OCI) allows you to build study cohorts, manipulate data, apply statistical models, and develop and validate machine learning pipelines.
Data Science Service enables researchers to generate statistical insights and evidence to help further their research.
Data scientists, biostatisticians with Python and SQL skills, and qualitative clinical researchers can use DSS to construct cohorts from the Real-World Data dataset based on criteria defined in study design, perform advanced statistical analysis, handle complex data, and perform machine learning modeling on RWD data.
Data Science Service is a platform for data scientists to build, train, deploy, and manage machine learning models using Python and other open source tools. A JupyterLab environment is provided to experiment and develop models. You also can scale up model training using NVIDIA GPUs and distributed training.
This section contains the following tasks:
- Analyze Data Using Data Science Service Notebooks
Use Data Science Service to perform statistical analysis and modeling to answer your research questions. At a high level, you need to set up a workspace in Data Science Service, prepare the data, conduct analysis, interpret the results, then share the findings. - Sign in to Data Science Service from OCI
Access Data Science Service from your organizations Oracle Cloud Infrastructure (OCI) console. - Create Data Science Service Projects
Create projects in Data Science Service to organize your work. Projects contain a collection of Data Science resources such as notebooks. When you sign in to Data Science Service, access the Projects page. - Create Notebook Sessions Using JupyterLab
Notebook sessions are interactive coding environments where you can build and train models. Oracle Life Sciences AI Data Platform uses Data Science Service from Oracle Cloud Infrastructure. - Configure Data Science Service Notebooks Using PySpark
The Python3 conda environment is preinstalled in the notebook session. The conda environment is a Python3-based conda environment and has a minimal set of Python libraries installed. - Connect to Oracle Health Real-World Data from Notebooks
There are a variety of methods to connect to Oracle Health Real-World Data from notebooks. - FAQs for Data Science Service
Answers to frequently asked questions for Data Science Service in Oracle Life Sciences AI Data Platform.