Giter VIP home page Giter VIP logo

Dr. Ayaz H. Khan's Projects

cppe-dataset icon cppe-dataset

Code for our paper CPPE - 5 (Medical Personal Protective Equipment), a new challenging object detection dataset

cs344 icon cs344

Introduction to Parallel Programming class code

gpuocelot icon gpuocelot

Automatically exported from code.google.com/p/gpuocelot

human-eval icon human-eval

Code for the paper "Evaluating Large Language Models Trained on Code"

hydrazine icon hydrazine

Automatically exported from code.google.com/p/hydrazine

image-recognition-tutorial-using-mxnet-with-docker icon image-recognition-tutorial-using-mxnet-with-docker

This is an extension of the tutorial available at https://www.r-bloggers.com/image-recognition-tutorial-in-r-using-deep-convolutional-neural-networks-mxnet-package/ for image recognition example using MXNet. The users can easily build a docker image for the required environment and directly start running the example. There is no need to do installation of linux and other required packages.

jetbot icon jetbot

An educational AI robot based on NVIDIA Jetson Nano.

jetbot_ros icon jetbot_ros

ROS nodes and Gazebo model for NVIDIA JetBot with Jetson Nano

morphologica icon morphologica

A library of supporting code for numerical modelling (JSON config, HDF5 data, Modern OpenGL visualization)

nemo icon nemo

NeMo: a toolkit for conversational AI

padding_free_matrix_transpose_gpu icon padding_free_matrix_transpose_gpu

The advances of Graphic Processing Units (GPU) technology and the introduction of CUDA programming model facilitates developing new solutions for sparse and dense linear algebra solvers. Matrix Transpose is an important linear algebra procedure that has deep impact in various computational science and engineering applications. Several factors hinder the expected performance of large matrix transpose on GPU devices. The degradation in performance involves the memory access pattern such as coalesced access in the global memory and bank conflict in the shared memory of streaming multiprocessors within the GPU. In this paper, two matrix transpose algorithms are proposed to alleviate the aforementioned issues of ensuring coalesced access and conflict free bank access. The proposed algorithms have comparable execution times with the NVIDIA SDK bank conflict - free matrix transpose implementation. The main advantage of proposed algorithms is that they eliminate bank conflicts while allocating shared memory exactly equal to the tile size (T x T) of the problem space. However, to the best of our knowledge an extra space of Tx(T+1) needs to be allocated in the published research. We have also applied the proposed transpose algorithm to recursive gaussian implementation of NVIDIA SDK and achieved about 6% improvement in performance.

rstudio icon rstudio

A repository that will be linked with RStudio

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.