Giter VIP home page Giter VIP logo

Comments (9)

MatthewARM avatar MatthewARM commented on August 15, 2024

Hi @liviolima80 we're seeing about 5 times faster than upstream Caffe across various tests on Cortex-A72 and Cortex-A54 using the NEON backend. I'm afraid we don't have benchmarking running on Cortex-A9, but realistically a smaller speedup would be expected.

from armnn.

liviolima80 avatar liviolima80 commented on August 15, 2024

Do you have public benchmark with the comparison to official Caffe library?

from armnn.

MatthewARM avatar MatthewARM commented on August 15, 2024

Hi @liviolima80 I have had a look and we have no public benchmark data at this time. We're working hard on the machine learning part of https://developer.arm.com/ and I will pass on your request to the relevant teams.

from armnn.

liviolima80 avatar liviolima80 commented on August 15, 2024

OK @MatthewARM ,
in the meantime, any option to run armnn in multi-thread mode in order to use multiple processors?

from armnn.

MatthewARM avatar MatthewARM commented on August 15, 2024

Hi @liviolima80 multi-threading is high up on our roadmap. In the meantime, you can try calling the methods of arm_compute::Scheduler yourself. The documentation for that is here: https://arm-software.github.io/ComputeLibrary/v18.03/classarm__compute_1_1_scheduler.xhtml

from armnn.

liviolima80 avatar liviolima80 commented on August 15, 2024

Hi @MatthewARM ,
I tried to take a look to the documentation and code but it is quite difficult to understand how to use. is it possible to have a clear and easy to use example on how to use the scheduler?
Regards

from armnn.

AnthonyBarbier avatar AnthonyBarbier commented on August 15, 2024

Please have a look at this question

from armnn.

liviolima80 avatar liviolima80 commented on August 15, 2024

Hi @AnthonyARM,
thank you for the link. I tried to change the number of threads in the scheduler setting but it seems that the better performance are obtained with the default config that is equal to 4 threads, that corresponds to the number of cores in the Cortex A9 architecture I'm using.
I'm looking forward to see if next releases of armnn will have some further improvement

from armnn.

praisonjoshua avatar praisonjoshua commented on August 15, 2024

I'm trying to compare the processing time running a convolutional network with Caffe C++ library and ArmnnCaffeParser built on ARM ComputeLibrary. These are details:

target architecture: Freescale IMX6Q

input image: 256x256 pixel and 3 channels

architecture:
conv2d 3x3 - 32 filters + RELU
conv2d 3x3 - 32 filters + RELU
max pool 2x2
conv2d 3x3 - 64 filters + RELU
conv2d 3x3 - 64 filters + RELU
max pool 2x2
conv2d 3x3 - 128 filters + RELU
max pool 2x2
conv2d 3x3 - 256 filters + RELU
max pool 2x2
conv2d 3x3 - 256 filters + RELU
max pool 2x2
fully connected layer with 48 output
relu
fully connected layer with 3 output
softmax

processing time:

  • Caffe c++ library (without ComputeLibrary optimization) : 4.5 seconds
  • ArmnnCaffeParser (with ComputeLibrary optimization): 2.2 seconds

I'm a little bit disappointed since my expectations were to boost the processing time more than a factor 2

Do you think that I'm doing something wrong or is this improvement factor comparable with your benchmarks?

Thanks

How did you calculate processing time ?
We are building a caffe2 parser for armnn we want to compare it with the default caffe2

from armnn.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.