UC San Diego Datahub & Research Platform
On November 17th, we detected mass networking errors to DataHub. We have since reset the Hub's database which has appeared to fix the issue. Users who were running pods during the outage will have to wait for their pods to self-terminate (~1 hour after 2:30) before they can launch another one. We apologize for any inconvenience the outage may have caused. -DSMLP Staff
- Request Datahub/DSMLP - Instructional Technology Request (CINFO)
- Scope of Support & Guidelines for Usage
- Instructor Guidance for Datahub/DSMLP
- Educational Technology Services Instructional Github
- Blink Documentation
- Datahub Grading Tools
Web-based Jupyter notebooks allow users to combine live code, equations, visualizations and narrative text for data cleaning and transformation, numerical simulation, statistical modeling, data visualization, machine learning, and more. DSMLP's Jupyter notebooks offer straightforward interactive access to popular languages and GPU-enabled frameworks such as Python, R, Pandas, PyTorch, TensorFlow, Keras, NLTK, and AllenNLP.
Complex ML workflows are supported through terminal/SSH logins, background batch jobs, and a full Linux/Ubuntu CUDA development suite. Users may install additional library packages (e.g. conda/pip, CRAN) as needed, or can opt to replace the default environment entirely by launching their own custom Docker containers.
High-speed cluster-local storage houses student workspaces, course files, and common training corpora (e.g. CIFAR, ImageNet).
If you would like to acknowledge this service in any publications, please use the text below.The Data Science and Machine Learning Platform is operated by IT Services (ITS), with additional financial contributions from Cognitive Science and Jacobs School of Engineering