Skip to main content

big-data

Mentors and Regional Facilitators
Name Region Skills Interests
Tony Elam Kentucky
Bala Desinghu CAREERS
Brian Gregor Northeast, Campus Champions
Feng George Yu Campus Champions
Jordan Hayes Campus Champions
Jacob Pessin Northeast
Katia Bulekova Campus Champions, CAREERS, Northeast, ACCESS CSSN
Thomas Langford Campus Champions, CAREERS
shuai liu ACCESS CSSN
Maryam Taeb
Neil McGlohon CAREERS
Jeffrey J. Nuc… CAREERS
Mahmoud Parvizi Campus Champions
Rob Harbert Northeast
Grant Scott Great Plains
Simon Delattre
Suhong Li CAREERS, ACCESS CSSN
Scott Valcourt Northeast, Campus Champions
Yun Shen CAREERS, Northeast, ACCESS CSSN
Yongwook Song Kentucky
Topics from Ask.CI
Loading topics from Ask.CI ...

Affinity Groups

Name Description Tags Join
Four people surround a giant disk Large Data Sets For people who evaluate or use storage options for researchers with large data sets.  cloud-storage, big-data, data-transfer, open-storage-network, s3, ceph, hpc-storage
DARWIN ACCESS Affinity Group logo DARWIN DARWIN (Delaware Advanced Research Workforce and Innovation Network) is a big data and high performance computing system designed to catalyze Delaware research and education funded by a $1.4 million… big-data
Users
Name Roles Skills Interests
Abigail Waters
student facilitator
student facilitator
student champion
Brian Gregor
mentor
rcf
Ethan Davis
student facilitator
Jacob Pessin
mentor
Katia Bulekova
mentor
rcf
Kristi Burkholder
researcher/educator
Northeast Cyberteam
student facilitator
Rob Harbert
mentor
safwan wshah
researcher/educator
Scott Valcourt
mentor
researcher/educator
rcf
steering committee
Alexander Williams
researcher/educator
Yves Dubief
researcher/educator
Yun Shen
mentor

CI Links

Title Category Tags Skill Level
DARWIN Documentation Pages Documentation big-data
Expanse Home Page Website big-data
ACCESS HPC Workshop Series Learning big-data, deep-learning, machine-learning, neural-networks, tensorflow, gpu, PROFESSIONAL and WORKFORCE DEVELOPMENT, technical-training-for-hpc, training, openmpi, c, c++, fortran, openmp, programming, mpi, spark Beginner, Intermediate
Python Tools for Data Science Video Link ai, big-data, data-analysis, machine-learning, data-wrangling, data-science, technical-training-for-hpc, training, workforce-development, python, scikit-learn, sql Intermediate
Displaying Scientific Data with Tableau Video Link big-data, data-analysis, technical-training-for-hpc, training, workforce-development Intermediate
Recommended Libraries for Cyberinfrastructure Users Developing Jupyter Notebooks Website ANALYSES and ALGORITHMS, ai, big-data, data-analysis, machine-learning, data-sharing, data-lifecycle, data-management, data-management-software, data-reproducibility, data-wrangling, github-pages, workflow, FIELD of SCIENCE, astrophysics, data-science, computational-chemistry, genomics, materials-science, gravitational-waves, oceanography, particle-physics, physiology, psychology, quantum-computing, quantum-mechanics, biology, GATEWAYS and PORTALS, science gateway, conda, jupyterhub, python Beginner, Intermediate, Advanced, Expert
Introduction to Python for Digital Humanities and Computational Research Documentation ai, big-data, data-analysis, deep-learning, data-science, python Beginner
Introductory Tutorial to Numpy and Pandas for Data Analysis Documentation ai, big-data, data-analysis, vectorization Beginner
PyTorch for Deep Learning and Natural Language Processing Documentation ai, big-data, data-analysis, deep-learning, machine-learning, neural-networks Beginner
Projects
Project Title Project Institution Project Owner Tags Status Sort descending
Student-Developed HPC Cluster for Active Learning University of New Hampshire Scott Valcourt big-data, hardware, hpc-cluster-build, hpc-operations Complete
Student-led Development of Open Source Materials for Hadoop University of Maine Farmington Northeast Cyberteam big-data, ceph, data-wrangling, hadoop, storage Complete
Machine learning for material property prediction University of Maine Orono Northeast Cyberteam big-data, data-wrangling, computational-chemistry, molecular-dynamics, machine-learning, python, gpu Complete
BTLE Beaconing to Track Objects University of New Hampshire Scott Valcourt big-data, programming-best-practices, programming, hardware Complete
Genetics and Big Data UVM Northeast Cyberteam big-data Complete
Using Genetic Algorithms and Support Vector Machines in Forest Mapping University of Maine Kasey Legaard big-data, compiling, data-management, machine-learning, matlab, neural-networks, openstack, parallelization, programming, workflow Complete
LOBO Fleet Monitoring Darling Marine Center, University of Maine Northeast Cyberteam big-data, data-access-protocols, data-management, data-wrangling, metadata, file-formats, openstack, oceanography, python, software-installation, compiling, debugging Complete
Incorporating Hytools into the current image processing pipeline to produce better vegetation maps that will account for radiometric signals and will parallelize workflow University of Maine at Fort Kent Larry Whitsel big-data, geographic-information-system, hpc-operations, image-processing, python, r Complete
Analyzing Pathogenic Clinical Isolates Genomes to Identify Horizontal Gene Transfer of Antibiotic-Resistance Genes University of Maine at Presque Isle Larry Whitsel big-data, bioinformatics, hpc-storage Complete
Benchmarking Locally-Developed HPC Resources University of New Hampshire Scott Valcourt backup, big-data, data-management, file-systems, hpc-cluster-build, hpc-operations, permissions, provisioning, schedulers, slurm, unix-environment Complete
Deep Learning High-Resolution Land Cover Mapping for Vermont University of Vermont Jarlath O'Neil-Dunne arcgis, big-data, distributed-computing, geographic-information-system, image-processing, machine-learning, python Complete
Utility poles Geo-Localization and Risk Estimation using Deep Learning University of Vermont safwan wshah ai, arcgis, big-data, conda, cuda, deep-learning, geographic-information-system, gpu, machine-learning, pip, python, tensorflow, unix-environment Complete
Big Data Portal For Sharing Real-world Bioinformatics Data Sets to the Public Domain University of Maine, Augusta Bruce Segee big-data, bioinformatics, data-management, data-wrangling, hpc-storage, metadata, science gateway, storage Complete
Simulate and design “xenobots”, on the AMD platform University of Vermont Andrea Elledge administering-hpc, amber, big-data, biology, file-transfer, github, slurm Complete
Genome Sequencing of the Bornean Rock Frog Smith College Lisa Mangiamele big-data, bioinformatics, genomics In Progress
Blog Entries
There are no Blog Entries associated with this topic.