microsoft / aiskillsforwindows Goto Github PK

View Code? Open in Web Editor NEW

167.0 26.0 46.0 48.32 MB

Contains samples for implementing Windows Skills by extending the preview base API and using exsting skill packages

Home Page: https://docs.microsoft.com/en-us/windows/ai/windows-vision-skills/

License: MIT License

aiskillsforwindows's Introduction

This code repo is now deprecated and the project is unsupported

AI Skills for Windows

Summary

Implementing and integrating efficient AI and Computer Vision (CV) solutions is a hard task for developers. The industry is moving at a fast pace and the amount of custom-tailored solutions coming out makes it almost impossible for app developers to keep up easily. We preview a base framework to expose AI solutions called AI Skills for Windows as well as pre-built common solutions (i.e. detection, classification, segmentation, etc.) developed by Microsoft and partners.

The AI Skills for Windows framework is meant to standardize the way AI and CV is put to use within a Windows application (i.e.: UWP, Desktop Win32, .Net Core 3.0+) running on the edge. It aims to abstract away the complexity of AI techniques by simply defining the concept of skills which are modular pieces of code that process input(s) and produce output(s). The implementation that contains the complex details (i.e. pre and post processing of data, model inference, algorithm, transcoding, applying the right heuristics, etc.) is encapsulated by WinRT APIs that inherits the base class present in the Microsoft.AI.Skills.SkillInterface namespace, which leverages built-in Windows primitives which in-turn eases interop with built-in hardware acceleration leveraged by frameworks such as Windows ML. All AI Skills for Windows derivatives follow the same programatic paradigm and flow from a developer consumer standpoint: if you understand how to use one AI Skill for Windows, you understand how to use them all. (See key AI Skills API concepts)

While this release focuses on vision-oriented scenarios and primitives, this API is meant to accommodate any kind of input and output variable and a wide range of scenarios (Vision, Audio, Text, etc.). Any developer can extend this API set and expose their own AI skills. See skills released by Intel

If you are looking for the earlier preview release samples and documentation, we archived them in a branch here: Preview branch

Code samples for using AI skills for Windows published by Microsoft on nuget.org

Object Detector

	Detecting and classifying objects in images

Object Tracker

	Tracking objects in videos

Skeletal Detector

	Estimating poses of people in images

Concept Tagger

	Obtaining classification scores of concepts in images

Image Scanning

	A set of AI skills to achieve content scanning scenarios such as the ones featured in OfficeLens
CurvedEdgesDetector	Seeks within an image the pixels that constitute the curved edges composing the contour of a given quad and returns their coordinates.
ImageCleaner	Cleans and enhances an image given a specified preset.
ImageRectifier	Rectifies and crops an image to a rectangle plane given four UV coordinates.
LiveQuadDetector and QuadDetector	Searches an image for quadrilateral shapes and returns the coordinates of their corners if found. The LiveQuadDetector is a stateful version of the QuadDetector that attempts to detect only 1 quadrangle and keeps track of the previous quad detected to be used as guide which optimizes tracking performance as new frames are bound over time. This is well suited for most scenarios operating over a stream of frames over time. QuadDetector can be set to detect more than 1 quadrangle and will search the whole frame everytime unless a previous quadrangle is provided.
QuadEdgesDetector	Searches an image for the horizontal and vertical lines defining a quadrilateral shape's contour and returns their coordinates.

For samples using AI skills published by Intel on nuget.org see the Intel-AI GitHub and this link for further details

AI Skill Name	Description
Background Blur	Segments out individuals while blurring the background image to highlight the individuals in the foreground.
Background Replacement	Segments out individuals while replacing the background with a user-selected image.
Face Detection	Detects face(s) and returns face bounding box(es) and other attributes, such as eyes, mouths, or nose tips.
Intruder Detection	Detects intruder by checking to see if an additional face or person is present in the video frame.
Person Detection	Detects person(s) and returns person bounding box(es).
Super Resolution	Converts a low-resolution image or video frame (320 x 240) to a high-resolution image (1280 x 960).
Super Resolution (WinML)	Converts a low-resolution image or video frame (640 x 360) to a high-resolution image (1280 x 720).

Copyright (c) Microsoft Corporation. All rights reserved.

aiskillsforwindows's People

Contributors

Stargazers

Watchers

aiskillsforwindows's Issues

Output ObjectDetectorPreview Confidence Level with Matches

Request for ObjectDetectorPreview :
Is it possible to output the confidence level with each match - like what is returned for Windows ML / Custom Vision ONNX files? The Evaluations are returning some poor results in some cases (ie. mixing up leaves on a tree branch as a 'Person' from a 4k security camera is one ongoing issue) - I'm pretty sure ability to filter out low confidence matches would make a big difference here.

Win32 Consume Windows Skill without Application Manifest

I'm writing a win32 dll and using Windows Vision Skill to process input image, my dll will be loaded automatically by system process, however I don't have permission to edit application manifest.
I got exception "ClassFactory cannot supply requested class".

Does anyway to load Windows Vision Skill inside a win32 dll without using application manifest?

Vision Skill initialize failed in Xbox UWP

I tested on Xbox but when I call create skill method async (in your skeletal uwp sample) it crash with access violation exception

SkelectalDetector and Hand recognize

Hello,

Hand Recognize is not implemented?
Fabio

Exception throw out when run sample "SkeletalDetectorSample_Desktop"

Hi, I'm trying to run the vpu_preview sample "SkeletalDetectorSample_Desktop", my OS version is 19569, so I updated it on the project property and also updated the nuget package https://www.nuget.org/packages/Microsoft.AI.Skills.SkillInterfacePreview/# to latest one.

It's built successfully but when run it, exception happens. Do I need additional configuration to get it run? Thanks.

Exception thrown at 0x00007FFF88F9A599 (KernelBase.dll) in SkeletalDetectorSample_Desktop.exe: WinRT originate error - 0x80040154 : 'Class not registered'.
'SkeletalDetectorSample_Desktop.exe' (Win32): Loaded 'C:\Windows\System32\bcrypt.dll'.
'SkeletalDetectorSample_Desktop.exe' (Win32): Loaded 'C:\Windows\System32\sechost.dll'.
Exception thrown at 0x00007FFF88F9A599 in SkeletalDetectorSample_Desktop.exe: Microsoft C++ exception: winrt::hresult_error at memory location 0x000000CDA515E970.

Mobile Version of these skills

It would be nice if you can provide the NuGet packages that are compatible with .NET Standard 2.0, iOS and Android along with UWP. That way we could AI Vision apps for the mobile platforms. After all, mobile is nothing without cloud support, and cloud needs mobile frontend in addition to desktop, web, etc.

Could not find Windows Runtime type 'Microsoft.AI.Skills.SkillInterfacePreview.ISkill'

i loaded this lib on NuGet

SkeletalDetectorSample_UWP Can not find any bodies in GPU mode

The SkeletalDetectorSample_UWP can not find any bodies in GPU mode, but it runs correct in CPU mode. The ObjectDetectorSample_UWP runs correct both.

Environment:
Lonovo Y7000
Windows 10 Insider Preview Build 18965.rs_prerelease.190803
CPU: i7-9750H
RAM:16GB
GPU: Nvidia RTX 2060
IDE: Visual Studio 2019 community

missing packatge Contoso.FaceSentimentAnalyzer_CS

I searched several nuget sources, I did find three Windows.AI.Vision,xxx packages, but not the FaceSentimentAnalyzer. Is this sample still in development?

I see there is a similar sample Emoji8, I'll give that a try. This is for the Hackathon, so its not super critical … but it would be cool :)

https://github.com/Microsoft/Windows-Machine-Learning/tree/master/Samples/Emoji8/UWP/cs

Thanks, Conzog

Class not registered error on desktop sample

Hi!
I'm kind of new to C++/WinRT and trying to test desktop samples with VS2017.
I started from VisionSkillsSamples.sln, build and run Desktop/ObjectDetectorSample_Desktop.

While dealing with this issue, I found that my executable doesn't have dependencies on skill-related dlls(e.g. Microsoft.AI.Skills.SkillInterfacePreview.dll ...). I'm not sure this is related to my issue, but I expected to have those dependencies.

Please let me know if I missed any configuration required.
Thank you!

I had to run unblock-file on prebuild and postbuild scripts to make them work.

1.0.0.2 Intel Vision Skills NuGet Packages Broken / Attribution Text Files marked as 'Compile'

FYI:
There's a problem with the 1.0.0.2 'Intel' Vision Skills packages that were published by Intel. (not yet tried to see if the 1.0.0.3 updates suffer same problem).

All the embedded attibution 'text' files appear to have been included with the 'Compile' build action - which prevents any samples from being built once you download the packages from NuGet. (VS2019 throws a bunch of compile errors as it tries to treach all these text files as code).

To fix this I had to go into the NuGet package cache folder and manually delete all the .txt files from those subfolders before I could compile anything.

As per above - just saw 1.0.03 has just been published (with new skills) - will check to see if it's still a problem there (can't find any release notes for your stuff so I'm unable to see what was fixed).

thanks
Niall

This repo is missing important files

There are important files that Microsoft projects should all have that are not present in this repository. A pull request has been opened to add the missing file(s). When the pr is merged this issue will be closed automatically.

Microsoft teams can learn more about this effort and share feedback within the open source guidance available internally.

Merge this pull request

Do we have a sample work with Visual Studio Code?

Usage for Hololens 2

Respected People,

I am trying to develop an Object Detection Application for Hololens 2 via Unity. I just want to know whether the Namespaces and the Code Snippets used in this sample for Object Detection can also be used inside unity editor to be deployed to Hololens 2. Is there a possiblity? if not is there any other sample references which could be used for Object detection via Hololens 2. Please do hlep me with this.

Help with pattern recognition

Hello,
I want to study the use of AI in the context of pattern recognition. Not that of human faces that has been done for a long time, but in that of the recognition of mushroom spores. Visually we can compare these spores to seeds of various plants.
I start in this area but I think it is already building models of these objects.
Can Windows Vision Skills help with this project?

Thank you

SkeletalDetector very slow on Hololens 2

On my local machine (Win 10, Win 18362.1082) the skeletaldetector example runs very well with ~200ms eval time on the CPU (i7-6700k) and less than 100ms on the GPU (1080Ti).
When trying it out on the Hololens 2 (10.0.19041.1382), the framerate drops drastically with eval time of ~2000ms on the CPU (Armv8 64-Bit Family 8 Model 803 Revision 70C) and ~10000ms on the GPU (Qualcomm Adreno 630GPU).
Any way to make this faster on the HL2?

Crushing after deploying on windows IOT

We have UWP application.

After adding this libs from NuGet we have a problem.

When we try to run our application in desktop (x86/64) - all works.

But when we try to run into rasbery pi 3 (windows IoT, ARM) we have an exception when we try to initailize class ObjectDetectorDescriptor

HRESULT: 0x80040154 (REGDB_E_CLASSNOTREG)

for example:
var descriptor = new ObjectDetectorDescriptor();

Sometime could not get result

I use AISkill - Concepttagger in C++/WinRT project to analyze Images
It work great most of time
however, sometimes, I could not get any result from same picture I get result success before
and I didn't get any error message
I just know that Microsoft.AI.Skills.SkillInterface.dll doesn't work

Data Error (Cyclic Redundancy Check)"

hi, i run the projrct of ObjectDetectorSample .error : Data Error (Cyclic Redundancy Check)"
run location " await InitializeObjectDetectorAsync();"

how fix this
thanks!