Skip to main content

big-data

Mentors and Regional Facilitators
Name Region Skills Interests
Tony Elam Kentucky
Alana Romanella Campus Champions
Bala Desinghu CAREERS
Brian Gregor Northeast, Campus Champions
Balamurugan Desinghu ACCESS CSSN, Campus Champions, CAREERS, Northeast
Dylan Perkins RMACC, ACCESS CSSN
Fernando Garzon ACCESS CSSN
Feng George Yu Campus Champions
Jordan Hayes Campus Champions
Jacob Pessin Northeast
Katia Bulekova ACCESS CSSN, Campus Champions, CAREERS, Northeast
Thomas Langford Campus Champions, CAREERS
shuai liu ACCESS CSSN
Maryam Taeb
Neil McGlohon CAREERS
Jeffrey J. Nuc… CAREERS
Mahmoud Parvizi Campus Champions
Rob Harbert Northeast
Grant Scott Great Plains
Simon Delattre
Suhong Li CAREERS, ACCESS CSSN
Scott Valcourt Northeast, Campus Champions
Yun Shen CAREERS, Northeast, ACCESS CSSN
Yongwook Song Kentucky
Topics from Ask.CI
Loading topics from Ask.CI ...

Affinity Groups

Name Description Tags Join
Four people surround a giant disk Large Data Sets For people who evaluate or use storage options for researchers with large data sets.  cloud-storagebig-datadata-transferopen-storage-networks3cephhpc-storage Login to join
DARWIN ACCESS Affinity Group logo DARWIN DARWIN (Delaware Advanced Research Workforce and Innovation Network) is a big data and high performance computing system designed to catalyze Delaware research and education funded by a $1.4 million… darwinbig-data Login to join
Users
Name Roles Skills Interests
Abigail Waters
student facilitator
student facilitator
student champion
Brian Gregor
mentor
rcf
Balamurugan Desinghu
mentor
rcf
Ethan Davis
student facilitator
Jacob Pessin
mentor
Katia Bulekova
mentor
rcf
Kristi Burkholder
researcher/educator
Northeast Cyberteam
student facilitator
Rob Harbert
mentor
safwan wshah
researcher/educator
Scott Valcourt
mentor
researcher/educator
rcf
steering committee
Alexander Williams
researcher/educator
Yves Dubief
researcher/educator
Yun Shen
mentor

CI Links

Title Category Sort descending Tags Skill Level
DARWIN Documentation Pages Docs darwin, big-data Beginner, Intermediate, Advanced
Introduction to Python for Digital Humanities and Computational Research Docs ai, big-data, data-analysis, deep-learning, data-science, python Beginner
Introductory Tutorial to Numpy and Pandas for Data Analysis Docs ai, big-data, data-analysis, vectorization Beginner
PyTorch for Deep Learning and Natural Language Processing Docs ai, big-data, data-analysis, deep-learning, machine-learning, neural-networks Beginner
Machine Learning in Astrophysics Docs plotting, big-data, image-processing, machine-learning, astrophysics Intermediate
Pandas - Python Docs documentation, ai, big-data, data-analysis Beginner, Intermediate
ACCESS HPC Workshop Series Learning big-data, deep-learning, machine-learning, neural-networks, tensorflow, gpu, technical-training-for-hpc, training, openmpi, c, c++, fortran, openmp, programming, mpi, spark Beginner, Intermediate
DeapSECURE – Data-Enabled Advanced Computational Training Platform for Cybersecurity Research and Education Learning ai, visualization, big-data, data-analysis, deep-learning, machine-learning, neural-networks, jekyll, batch-jobs, slurm, bash, ssh, technical-training-for-hpc, training, workforce-development, python, scikit-learn, cybersecurity Beginner
Research Software Development in JupyterLab: A Platform for Collaboration Between Scientists and RSEs Learning ai, visualization, big-data, data-analysis, deep-learning, machine-learning, astrophysics, data-science, novel-accelerators, computational-chemistry, genomics, materials-science, gravitational-waves, oceanography, particle-physics, physiology, psychology, quantum-computing, quantum-mechanics, biology, ondemand, science gateway, c++, jupyterhub, python, r Beginner, Intermediate
Awesome Jupyter Widgets (for building interactive scientific workflows or science gateway tools) Learning ai, computer-graphics, plotting, visualization, big-data, data-analysis, deep-learning, image-processing, machine-learning, monte-carlo, neural-networks, data-sharing, data-lifecycle, data-management, data-management-software, data-reproducibility, github, astrophysics, data-science, novel-accelerators, computational-chemistry, genomics, materials-science, gravitational-waves, oceanography, particle-physics, physiology, psychology, quantum-computing, quantum-mechanics, biology, ondemand, science gateway, c++, jupyterhub, python Beginner, Intermediate, Advanced
Machine Learning with sci-kit learn Learning ai, big-data, machine-learning Beginner
Numpy - a Python Library Tool documentation, big-data, data-analysis, deep-learning, opencv, pytorch, tensorflow, data-science Beginner, Intermediate
Scikit-Learn: Easy Machine Learning and Modeling Tool documentation, ai, plotting, visualization, big-data, data-analysis, deep-learning, image-processing, machine-learning, monte-carlo, neural-networks, vectorization Beginner, Intermediate
Beautiful Soup - Simple Python Web Scraping Tool documentation, ai, big-data, data-sharing, data-transfer, data-wrangling Beginner, Intermediate
Python Tools for Data Science Video ai, big-data, data-analysis, machine-learning, data-wrangling, data-science, technical-training-for-hpc, training, workforce-development, python, scikit-learn, sql Intermediate
Displaying Scientific Data with Tableau Video big-data, data-analysis, technical-training-for-hpc, training, workforce-development Intermediate
Expanse Home Page Website big-data Beginner, Intermediate, Advanced
Recommended Libraries for Cyberinfrastructure Users Developing Jupyter Notebooks Website ai, big-data, data-analysis, machine-learning, data-sharing, data-lifecycle, data-management, data-management-software, data-reproducibility, data-wrangling, github-pages, workflow, astrophysics, data-science, computational-chemistry, genomics, materials-science, gravitational-waves, oceanography, particle-physics, physiology, psychology, quantum-computing, quantum-mechanics, biology, science gateway, conda, jupyterhub, python Beginner, Intermediate, Advanced
Projects
Project Title Project Institution Sort descending Project Owner Tags Status
LOBO Fleet Monitoring Darling Marine Center, University of Maine Northeast Cyberteam big-data, data-access-protocols, data-management, data-wrangling, metadata, file-formats, openstack, oceanography, python, software-installation, compiling, debugging Complete
Genome Sequencing of the Bornean Rock Frog Smith College Lisa Mangiamele big-data, bioinformatics, genomics In Progress
Using Genetic Algorithms and Support Vector Machines in Forest Mapping University of Maine Kasey Legaard big-data, compiling, data-management, machine-learning, matlab, neural-networks, openstack, parallelization, programming, workflow Complete
Incorporating Hytools into the current image processing pipeline to produce better vegetation maps that will account for radiometric signals and will parallelize workflow University of Maine at Fort Kent Larry Whitsel big-data, geographic-information-system, hpc-operations, image-processing, python, r Complete
Analyzing Pathogenic Clinical Isolates Genomes to Identify Horizontal Gene Transfer of Antibiotic-Resistance Genes University of Maine at Presque Isle Larry Whitsel big-data, bioinformatics, hpc-storage Complete
Student-led Development of Open Source Materials for Hadoop University of Maine Farmington Northeast Cyberteam big-data, ceph, data-wrangling, hadoop, storage Complete
Machine learning for material property prediction University of Maine Orono Northeast Cyberteam big-data, data-wrangling, computational-chemistry, molecular-dynamics, machine-learning, python, gpu Complete
Big Data Portal For Sharing Real-world Bioinformatics Data Sets to the Public Domain University of Maine, Augusta Bruce Segee big-data, bioinformatics, data-management, data-wrangling, hpc-storage, metadata, science gateway, storage Complete
Student-Developed HPC Cluster for Active Learning University of New Hampshire Scott Valcourt big-data, hardware, hpc-cluster-build, hpc-operations Complete
BTLE Beaconing to Track Objects University of New Hampshire Scott Valcourt big-data, programming-best-practices, programming, hardware Complete
Benchmarking Locally-Developed HPC Resources University of New Hampshire Scott Valcourt backup, big-data, data-management, file-systems, hpc-cluster-build, hpc-operations, permissions, provisioning, schedulers, slurm, unix-environment Complete
Deep Learning High-Resolution Land Cover Mapping for Vermont University of Vermont Jarlath O'Neil-Dunne arcgis, big-data, distributed-computing, geographic-information-system, image-processing, machine-learning, python Complete
Utility poles Geo-Localization and Risk Estimation using Deep Learning University of Vermont safwan wshah ai, arcgis, big-data, conda, cuda, deep-learning, geographic-information-system, gpu, machine-learning, pip, python, tensorflow, unix-environment Complete
Simulate and design “xenobots”, on the AMD platform University of Vermont Andrea Elledge administering-hpc, amber, big-data, biology, file-transfer, github, slurm Complete
Genetics and Big Data UVM Northeast Cyberteam big-data Complete
Blog Entries
There are no Blog Entries associated with this topic.