Difference between revisions of "Kaustubh Shivdikar"
m (New line added) (Tag: Visual edit) |
(Ph.D. thesis added) (Tag: Visual edit) |
||
(5 intermediate revisions by the same user not shown) | |||
Line 61: | Line 61: | ||
==Publications== | ==Publications== | ||
− | [[File:PIM Logo.png|alt=Scalability Limitations of Processing-in-Memory using Real System Evaluations|right|frameless| | + | [[File:neuracore.png|right|frameless|76x76px|NeuraCore Image]] |
+ | |||
+ | ======[https://wiki.kaustubh.us/w/img_auth.php/NeuraChip_GNN_Accelerator.pdf NeuraChip: Accelerating GNN Computations with a Hash-based Decoupled Spatial Accelerator]====== | ||
+ | {{Nutshell|Decoupled multiplication and addition operations to speedup GNNs.|title=NeuraChip}} | ||
+ | |||
+ | (ISCA 2024) [[https://wiki.kaustubh.us/w/img_auth.php/NeuraChip_GNN_Accelerator.pdf PDF]] [[https://www.researchgate.net/publication/380127919_NeuraChip_Accelerating_GNN_Computations_with_a_Hash-based_Decoupled_Spatial_Accelerator RG]] [Slides] [[https://github.com/NeuraChip/neurachip GitHub]] [[https://neurachip.us/ Website]] | ||
+ | |||
+ | ==.== | ||
+ | [[File:gpu icon.png|right|frameless|93x93px]] | ||
+ | |||
+ | ======[https://wiki.kaustubh.us/w/img_auth.php/Graph_Accelerator_Thesis.pdf Enabling Accelerators for Graph Computing]====== | ||
+ | {{Nutshell|Software and Hardware enhancements for GNNs|title=Thesis}} | ||
+ | |||
+ | (Ph.D. Thesis) [[https://wiki.kaustubh.us/w/img_auth.php/scalability_limitations.pdf PDF]] | ||
+ | ==.== | ||
+ | [[File:PIM Logo.png|alt=Scalability Limitations of Processing-in-Memory using Real System Evaluations|right|frameless|83x83px]] | ||
======[https://wiki.kaustubh.us/w/img_auth.php/scalability_limitations.pdf Scalability Limitations of Processing-in-Memory using Real System Evaluations]====== | ======[https://wiki.kaustubh.us/w/img_auth.php/scalability_limitations.pdf Scalability Limitations of Processing-in-Memory using Real System Evaluations]====== | ||
Line 78: | Line 93: | ||
|} | |} | ||
− | == . == | + | ==.== |
− | [[File: | + | [[File:network logo.png|alt=GNN Logo|right|frameless|82x82px|GNN Logo]] |
======[https://wiki.kaustubh.us/w/img_auth.php/maxk.pdf MaxK-GNN: Towards Theoretical Speed Limits for Accelerating Graph Neural Networks Training]====== | ======[https://wiki.kaustubh.us/w/img_auth.php/maxk.pdf MaxK-GNN: Towards Theoretical Speed Limits for Accelerating Graph Neural Networks Training]====== | ||
Line 95: | Line 110: | ||
|- | |- | ||
|'''Authors''': Hongwu Peng, Xi Xie, Kaustubh Shivdikar, MD Amit Hasan, Shaoyi Huang, Omen Khan, Caiwen Ding, David Kaeli | |'''Authors''': Hongwu Peng, Xi Xie, Kaustubh Shivdikar, MD Amit Hasan, Shaoyi Huang, Omen Khan, Caiwen Ding, David Kaeli | ||
− | |}[[File:AMD MI100 GPU.png|right|frameless| | + | |} |
+ | |||
+ | ==.== | ||
+ | [[File:AMD MI100 GPU.png|right|frameless|226x226px|AMD MI100 GPU]] | ||
======[https://wiki.kaustubh.us/w/img_auth.php/GME.pdf GME: GPU-based Microarchitectural Extensions to Accelerate Homomorphic Encryption]====== | ======[https://wiki.kaustubh.us/w/img_auth.php/GME.pdf GME: GPU-based Microarchitectural Extensions to Accelerate Homomorphic Encryption]====== | ||
Line 131: | Line 149: | ||
|- | |- | ||
|Authors: '''[[Kaustubh Shivdikar]]''', [https://scholar.google.com/citations?user=huCqT6IAAAAJ&hl=en&oi=ao Yuhui Bao], [https://www.linkedin.com/in/rashmi-agrawal-9a0601133/ Rashmi Agrawal], Michael Shen, [https://ieeexplore.ieee.org/author/37088654483 Gilbert Jonatan], [https://eveliomora.es/ Evelio Mora], Alexander Ingare, [https://neallivesay.github.io/ Neal Livesay], [https://sites.google.com/ucam.edu/jlabellan/ José L. Abellán], [http://icn.kaist.ac.kr/~jjk12/ John Kim], [https://www.bu.edu/eng/profile/ajay-joshi/ Ajay Joshi], [https://ece.northeastern.edu/fac-ece/kaeli.html David Kaeli] | |Authors: '''[[Kaustubh Shivdikar]]''', [https://scholar.google.com/citations?user=huCqT6IAAAAJ&hl=en&oi=ao Yuhui Bao], [https://www.linkedin.com/in/rashmi-agrawal-9a0601133/ Rashmi Agrawal], Michael Shen, [https://ieeexplore.ieee.org/author/37088654483 Gilbert Jonatan], [https://eveliomora.es/ Evelio Mora], Alexander Ingare, [https://neallivesay.github.io/ Neal Livesay], [https://sites.google.com/ucam.edu/jlabellan/ José L. Abellán], [http://icn.kaist.ac.kr/~jjk12/ John Kim], [https://www.bu.edu/eng/profile/ajay-joshi/ Ajay Joshi], [https://ece.northeastern.edu/fac-ece/kaeli.html David Kaeli] | ||
− | |}[[File:V100 GPU.png|right|frameless| | + | |} |
+ | |||
+ | ==.== | ||
+ | [[File:V100 GPU.png|right|frameless|168x168px]] | ||
======[https://wiki.kaustubh.us/w/img_auth.php/FHE_IEEE_Micro.pdf Accelerating Finite Field Arithmetic for Homomorphic Encryption on GPUs]====== | ======[https://wiki.kaustubh.us/w/img_auth.php/FHE_IEEE_Micro.pdf Accelerating Finite Field Arithmetic for Homomorphic Encryption on GPUs]====== | ||
{{Nutshell|Optimized modulo "%" operator on NVIDIA V100 GPU to speedup Homomorphic Encryption.|title=Paper}} | {{Nutshell|Optimized modulo "%" operator on NVIDIA V100 GPU to speedup Homomorphic Encryption.|title=Paper}} | ||
− | ([https://ieeexplore.ieee.org/document/10068510 IEEE MICRO 2023]) [[https://wiki.kaustubh.us/w/img_auth.php/FHE_IEEE_Micro.pdf PDF]] [[https://www.researchgate.net/publication/369197973_Accelerating_Finite_Field_Arithmetic_for_Homomorphic_Encryption_on_GPUs RG]][[File:Lady Bug.png|right|frameless| | + | ([https://ieeexplore.ieee.org/document/10068510 IEEE MICRO 2023]) [[https://wiki.kaustubh.us/w/img_auth.php/FHE_IEEE_Micro.pdf PDF]] [[https://www.researchgate.net/publication/369197973_Accelerating_Finite_Field_Arithmetic_for_Homomorphic_Encryption_on_GPUs RG]] |
+ | ==.== | ||
+ | [[File:Lady Bug.png|right|frameless|111x111px]] | ||
======'''[https://wiki.kaustubh.us/w/img_auth.php/FHE_SEED_2022.pdf Accelerating Polynomial Multiplication for Homomorphic Encryption on GPUs]'''====== | ======'''[https://wiki.kaustubh.us/w/img_auth.php/FHE_SEED_2022.pdf Accelerating Polynomial Multiplication for Homomorphic Encryption on GPUs]'''====== | ||
Line 161: | Line 184: | ||
|} | |} | ||
− | + | ==.== | |
− | + | [[File:Hacker icon.png|right|frameless|90x90px]] | |
− | |||
− | [[File:Hacker icon.png|right|frameless| | ||
======'''[https://wiki.kaustubh.us/w/img_auth.php/JAXED_Reverse_Engineering_DNN_Architectures_Leveraging_JIT_GEMM_Libraries.pdf JAXED: Reverse Engineering DNN Architectures Leveraging JIT GEMM Libraries]'''====== | ======'''[https://wiki.kaustubh.us/w/img_auth.php/JAXED_Reverse_Engineering_DNN_Architectures_Leveraging_JIT_GEMM_Libraries.pdf JAXED: Reverse Engineering DNN Architectures Leveraging JIT GEMM Libraries]'''====== | ||
Line 187: | Line 208: | ||
|} | |} | ||
− | + | ==.== | |
− | + | [[File:Mini GNN.png|right|frameless|98x98px]] | |
− | |||
− | [[File:Mini GNN.png|right|frameless| | ||
======'''[https://wiki.kaustubh.us/w/img_auth.php/GNNMark.pdf GNNMark: A benchmark suite to characterize graph neural network training on GPUs]'''====== | ======'''[https://wiki.kaustubh.us/w/img_auth.php/GNNMark.pdf GNNMark: A benchmark suite to characterize graph neural network training on GPUs]'''====== | ||
Line 213: | Line 232: | ||
|} | |} | ||
− | + | ==.== | |
− | + | [[File:Core Image SMASH.png|right|frameless|45x45px]] | |
− | |||
− | [[File:Core Image SMASH.png|right|frameless| | ||
======'''[https://wiki.kaustubh.us/w/img_auth.php/SMASH_Thesis.pdf SMASH: Sparse Matrix Atomic Scratchpad Hashing]'''====== | ======'''[https://wiki.kaustubh.us/w/img_auth.php/SMASH_Thesis.pdf SMASH: Sparse Matrix Atomic Scratchpad Hashing]'''====== | ||
Line 241: | Line 258: | ||
|} | |} | ||
+ | ==.== | ||
− | |||
− | |||
======'''[https://wiki.kaustubh.us/w/img_auth.php/SC_18_Cluster_Contest_Paper.pdf Student cluster competition 2018, team northeastern university: Reproducing performance of a multi-physics simulations of the Tsunamigenic 2004 Sumatra Megathrust earthquake on the AMD EPYC 7551 architecture]'''====== | ======'''[https://wiki.kaustubh.us/w/img_auth.php/SC_18_Cluster_Contest_Paper.pdf Student cluster competition 2018, team northeastern university: Reproducing performance of a multi-physics simulations of the Tsunamigenic 2004 Sumatra Megathrust earthquake on the AMD EPYC 7551 architecture]'''====== | ||
{{Nutshell|Optimizing earthquake simulation workload on AMD CPUs.|title=Paper}} | {{Nutshell|Optimizing earthquake simulation workload on AMD CPUs.|title=Paper}} | ||
Line 261: | Line 277: | ||
|} | |} | ||
+ | ==.== | ||
======'''[https://wiki.kaustubh.us/w/img_auth.php/BARC_speeding.pdf Speeding up DNNs using HPL based Fine-grained Tiling for Distributed Multi-GPU Training]'''====== | ======'''[https://wiki.kaustubh.us/w/img_auth.php/BARC_speeding.pdf Speeding up DNNs using HPL based Fine-grained Tiling for Distributed Multi-GPU Training]'''====== | ||
Line 267: | Line 284: | ||
([https://bostonarch.github.io/2018/ BARC 2018]) [[https://wiki.kaustubh.us/w/img_auth.php/BARC_speeding.pdf PDF]] | ([https://bostonarch.github.io/2018/ BARC 2018]) [[https://wiki.kaustubh.us/w/img_auth.php/BARC_speeding.pdf PDF]] | ||
[[https://www.researchgate.net/publication/357766887_Speeding_up_DNNs_using_HPL_based_Fine-grained_Tiling_for_Distributed_Multi-GPU_Training RG]] | [[https://www.researchgate.net/publication/357766887_Speeding_up_DNNs_using_HPL_based_Fine-grained_Tiling_for_Distributed_Multi-GPU_Training RG]] | ||
+ | |||
+ | ==.== | ||
======'''[https://wiki.kaustubh.us/w/img_auth.php/Video_Steganography.pdf Video steganography using encrypted payload for satellite communication]'''====== | ======'''[https://wiki.kaustubh.us/w/img_auth.php/Video_Steganography.pdf Video steganography using encrypted payload for satellite communication]'''====== | ||
Line 273: | Line 292: | ||
([https://2017.aeroconf.org/ Aerospace Conference 2017]) [[https://wiki.kaustubh.us/w/img_auth.php/Video_Steganography.pdf PDF]] | ([https://2017.aeroconf.org/ Aerospace Conference 2017]) [[https://wiki.kaustubh.us/w/img_auth.php/Video_Steganography.pdf PDF]] | ||
[[https://www.researchgate.net/publication/317702110_Video_steganography_using_encrypted_payload_for_satellite_communication RG]] | [[https://www.researchgate.net/publication/317702110_Video_steganography_using_encrypted_payload_for_satellite_communication RG]] | ||
+ | |||
+ | ==.== | ||
======'''[https://wiki.kaustubh.us/w/img_auth.php/missing_middle.pdf Missing 'Middle Scenarios' Uncovering Nuanced Conditions in Latin America's Housing Crisis]'''====== | ======'''[https://wiki.kaustubh.us/w/img_auth.php/missing_middle.pdf Missing 'Middle Scenarios' Uncovering Nuanced Conditions in Latin America's Housing Crisis]'''====== | ||
Line 280: | Line 301: | ||
[[https://www.researchgate.net/publication/361864952_Missing_Middle_Scenarios_Uncovering_Nuanced_Conditions_in_Latin_America's_Housing_Crisis RG]] | [[https://www.researchgate.net/publication/361864952_Missing_Middle_Scenarios_Uncovering_Nuanced_Conditions_in_Latin_America's_Housing_Crisis RG]] | ||
+ | ==.== | ||
======'''[https://wiki.kaustubh.us/w/img_auth.php/dynamic_power.pdf Dynamic power allocation using Stackelberg game in a wireless sensor network]'''====== | ======'''[https://wiki.kaustubh.us/w/img_auth.php/dynamic_power.pdf Dynamic power allocation using Stackelberg game in a wireless sensor network]'''====== | ||
Line 286: | Line 308: | ||
([https://2016.aeroconf.org/ Aerospace Conference 2016]) [[https://wiki.kaustubh.us/w/img_auth.php/dynamic_power.pdf PDF]] | ([https://2016.aeroconf.org/ Aerospace Conference 2016]) [[https://wiki.kaustubh.us/w/img_auth.php/dynamic_power.pdf PDF]] | ||
[[https://www.researchgate.net/publication/294873909_Dynamic_Power_Allocation_using_Stackelberg_Game_in_a_Wireless_Sensor_Network RG]] | [[https://www.researchgate.net/publication/294873909_Dynamic_Power_Allocation_using_Stackelberg_Game_in_a_Wireless_Sensor_Network RG]] | ||
+ | |||
+ | ==.== | ||
======'''[https://wiki.kaustubh.us/w/img_auth.php/automatic_image.pdf Automatic image annotation using a hybrid engine]'''====== | ======'''[https://wiki.kaustubh.us/w/img_auth.php/automatic_image.pdf Automatic image annotation using a hybrid engine]'''====== | ||
Line 293: | Line 317: | ||
[[https://www.researchgate.net/publication/294874083_Automatic_Image_Annotation_using_a_Hybrid_Engine RG]] | [[https://www.researchgate.net/publication/294874083_Automatic_Image_Annotation_using_a_Hybrid_Engine RG]] | ||
− | |||
− | |||
<br /> | <br /> | ||
<br /> | <br /> | ||
Line 300: | Line 322: | ||
<br /> | <br /> | ||
<br /><br /> | <br /><br /> | ||
− | |||
− | |||
− | |||
− | |||
Latest revision as of 08:19, 30 April 2024
Hi, I am Kaustubh, a Ph.D. candidate studying computer engineering in NUCAR lab at Northeastern University with my advisor David Kaeli. My research focuses on designing hardware accelerators for sparse graph workloads.
My expertise lies in:
- Computer Architecture Simulator Design
- Graph Neural Network Accelerators
- Sparse Matrix Accelerators
- Homomorphic Encryption Accelerators
- GPU Kernel Design
Contact:
- shivdikar.k [at] northeastern [dot] edu
- mail [at] kaustubh [dot] us
[Resume]
Education
- PhD - Computer Engineering, Northeastern University [May 2024]
- MS - Electrical and Computer Engineering, Northeastern University [May 2021]
- BS - Electrical Engineering, Veermata Jijabai Technological Institute [May 2016]
Work
- Summer-Fall 2020 Coop: Parallel Computing Lab @ Intel Labs with Fabrizio Petrini.
- Summer-Fall 2019 Coop: Parallel Computing Lab @ Intel Labs with Fabrizio Petrini.
- Summer-Fall 2018 Coop: Mobile Robotics @ Omron Adept with George Paul.
Recent News
- June 2022: Mentored Lina Adkins for the GNN Acceleration project at REU-Pathways program
- May 2022: Served as Submission co-chair for HPCA 2023 conference.
- Jan 2020: Taught the GPU Programming Course at NEU
- April 2019: Graduate Innovator Award at the RISE 2019 Research Expo for our poster Pi-Tiles
- April 2018: Best Poster Award at the RISE 2018 Research Expo for our poster The Prime Hexagon
- Nov 2018: Mentored the NEU team for Student Cluster Contest at Super Computing Conference 2018
- Nov 2017: Joined the NEU Team for Student Cluster Contest at Super Computing Conference 2017
Publications
NeuraChip: Accelerating GNN Computations with a Hash-based Decoupled Spatial Accelerator
NeuraChip in a nutshell: Decoupled multiplication and addition operations to speedup GNNs. |
(ISCA 2024) [PDF] [RG] [Slides] [GitHub] [Website]
.
Enabling Accelerators for Graph Computing
Thesis in a nutshell: Software and Hardware enhancements for GNNs |
(Ph.D. Thesis) [PDF]
.
Scalability Limitations of Processing-in-Memory using Real System Evaluations
Paper in a nutshell: Suggest interconnects for PIM nodes to enable efficient all-to-all communication. |
(ACM SIGMETRICS 2024) [PDF] [RG]
.
MaxK-GNN: Towards Theoretical Speed Limits for Accelerating Graph Neural Networks Training
Paper in a nutshell: Presented MaxK non-linearity as a universal approximator to speedup GNN training |
(ASPLOS 2024) [PDF] [GitHub] [RG]
.
GME: GPU-based Microarchitectural Extensions to Accelerate Homomorphic Encryption
GME in a nutshell: Enhanced the AMD MI-100 GPU on-chip network to accelerate Homomorphic Encryption. |
(IEEE/ACM MICRO 2023) [PDF] [RG] [Slides]
.
Accelerating Finite Field Arithmetic for Homomorphic Encryption on GPUs
Paper in a nutshell: Optimized modulo "%" operator on NVIDIA V100 GPU to speedup Homomorphic Encryption. |
(IEEE MICRO 2023) [PDF] [RG]
.
Accelerating Polynomial Multiplication for Homomorphic Encryption on GPUs
Paper in a nutshell: Incorporated NVIDIA V100's shared memory to accelerate Homomorphic Encryption. |
(SEED 2022) [PDF][Slides] [RG]
.
JAXED: Reverse Engineering DNN Architectures Leveraging JIT GEMM Libraries
JAXED in a nutshell: Exposed software cache's vulnerability with a side-channel attack. |
(SEED 2021) [PDF] [Slides] [Poster] [RG]
.
GNNMark: A benchmark suite to characterize graph neural network training on GPUs
GNNMark in a nutshell: Created a standardized suite of GNN workloads to benchmark GPUs. |
(ISPASS 2021) [PDF] [RG]
.
SMASH: Sparse Matrix Atomic Scratchpad Hashing
SMASH in a nutshell: Optimized SpGEMM for Intel's PIUMA architecture. |
(MS Thesis, 2021) [PDF] [RG]
.
Student cluster competition 2018, team northeastern university: Reproducing performance of a multi-physics simulations of the Tsunamigenic 2004 Sumatra Megathrust earthquake on the AMD EPYC 7551 architecture
Paper in a nutshell: Optimizing earthquake simulation workload on AMD CPUs. |
.
Speeding up DNNs using HPL based Fine-grained Tiling for Distributed Multi-GPU Training
Paper in a nutshell: Evaluating the cluster equipped with NVIDIA GPUs using the Linpack benchmark. |
.
Video steganography using encrypted payload for satellite communication
Paper in a nutshell: Concealing secret messages in videos. |
(Aerospace Conference 2017) [PDF] [RG]
.
Missing 'Middle Scenarios' Uncovering Nuanced Conditions in Latin America's Housing Crisis
Paper in a nutshell: Integrating LSTM models with Latin American housing data to devise solutions. |
(Cityscape 2017) [PDF] [RG]
.
Dynamic power allocation using Stackelberg game in a wireless sensor network
Paper in a nutshell: Employing game theory for power distribution models. |
(Aerospace Conference 2016) [PDF] [RG]
.
Automatic image annotation using a hybrid engine
Paper in a nutshell: A hybrid engine that merges feature extraction with language models. |
(Indicon 2015) [PDF] [RG]
Posters
- FHE [PDF]
- JAXED [PDF]
- Pi-Tiles (Graduate Innovator Award) [PDF]
- The Prime Hexagon (Best Poster Award) [PDF]
What is KTB Wiki?
KTB Wiki, because the best way to store your knowledge is in an indexed SQL database.
This website was built on KTB Wiki. KTB wiki is my side project/attempt to consolidate knowledge gained during my Ph.D. journey. Though many other platforms provide similar service, the process of creating KTB Wiki was a learning experience since it taught me concepts of indexing, load balancing, and in-memory file systems. KTB Wiki was built using MediaWiki and is intended for research purposes only.
Interesting Reads