DEV Community

How to detect objects in videos in a web browser using YOLOv8 neural network and JavaScript

Andrey Germanov on May 31, 2023

Table of Contents Introduction Adding a video component to a web page Capture video frames for object detection Detect objects in video ...

Read full post

h-pozuelo • Jun 1 '23

I tried everything.
Your tutorial works fine with the 'yolov8n.onnx' model I've just exported.
But if I put my own 'yolov8_custom.onnx' model (large model trained) it doesn't detect anything.

Could you help me?

Andrey Germanov • Jun 1 '23 • Edited

Could you share the model and sample image or video for testing ?

h-pozuelo • Jun 1 '23

I was using webcam input (it worked with the yolo8n model)
Here is the model:
we.tl/t-NMFDafC8Pc
Also, here is the project folder if u wanna try:
we.tl/t-n2lGv4o0oW

I wasnt able to upload into GitHub, too large file :/

Andrey Germanov • Jun 1 '23 • Edited

These links do not work in my location.
Can you try Google Drive? It works fine here.

h-pozuelo • Jun 1 '23

Okey, one moment

h-pozuelo • Jun 1 '23

Here is the link:

drive.google.com/drive/folders/1FQ...

Andrey Germanov • Jun 1 '23 • Edited

Cool, I just ran your code and it worked. (However, I do not know American Signs language, maybe the model predicts incorrect labels).

h-pozuelo • Jun 1 '23

I think the model that is loading is the yolo8n not the yolov8_custom.

Maybe you need to modify the line that loads the model

h-pozuelo • Jun 1 '23

Yeah, I verify it, it is loading the yolov8n model.

Comment that line and uncomment the other line that loads myh model (yolov8_custom.onnx)

Andrey Germanov • Jun 1 '23 • Edited

Yes, changed to your custom model. It worked much slower, because it's large, but finally predicted something.

h-pozuelo • Jun 1 '23

So, you didnt change any code right?
works as pretended?

Andrey Germanov • Jun 1 '23 • Edited

Yes, it works. I did not change any other code, except the model file name.

But it's too slow for real time detection in videos for average user CPU, I think better to train it based on Tiny or Small models.

h-pozuelo • Jun 1 '23

I will try to train on yolov8 nano model.
Any other tip for my training you can give to me?
like, how many epochs should i train with, how much bacth, etc...
for the YOLO trainnig command I mean

thx for everything btw

h-pozuelo • Jun 1 '23

better train with pyTorch + cpu or pytorch + gpu if I'm gonna export the model to onnx format?

Andrey Germanov • Jun 1 '23

You can try 50 epochs
As a batch size, you can set -1 to use Autobatch (docs.ultralytics.com/reference/yol...)

Andrey Germanov • Jun 1 '23 • Edited

GPU is only to increase training speed. For ONNX export, it does not matter what you will use, the result will be the same.

Nomaan • Mar 7 • Edited

Hi,
Thank you for your article, it has been a very big help to my project.

I am using python server to run my html page.
I have downloaded and added the ort-wasm-simd.wasm file to the same directory as my index.html file from the link you provided in an earlier reply but I am still getting the errors that I have attached herewith.

I have also imported the ort-min.js using importScripts function in my worker.js file .

Could you please help me solve this problem ?

Andrey Germanov • Mar 7 • Edited

Yes, this is a common problem. It means that the 'ort-min.js' and 'ort-wasm-simd.wasm' are from different versions of the ONNX runtime.

You need to download and use both files from the same version. When the article was written it was 1.15.1, but now it is 1.17.1.

Download both files from last ONNX runtime distribution:

cdn.jsdelivr.net/npm/onnxruntime-w...

Include them and try again.

Nomaan • Mar 8

Hi I included the latest files and that error has been resolved but I am facing another error now.

Andrey Germanov • Mar 7

Sorry, but I can't read the text of the error message. Can you add bigger screenshot?

Nomaan • Mar 7 • Edited

Sure
(drive.google.com/file/d/1akbX83N-s...)

Nomaan • Mar 8

The above error has been resolved (Thank you for your help) but the code is drawing the boxes in random places.
I wanted to ask if this code will work for 800X800 size images, since my onnx file accepts input of 800X800.

Andrey Germanov • Mar 8

The standard YOLOv8 model accepts 640x640 images, so, the code resizes any image to 640x640 before processing, and then scales the boxes using this size in mind.

To make it work with 800x800 size, you need to replace all occurrences of the "640" number to "800" in the "prepare_input", "run_model" and "process_output" functions.

Comment deleted

Andrey Germanov • Mar 12 • Edited

Sorry, Django is out of scope here.
If the solution works standalone, then the problem is definitely on the Django end. Search how to correctly integrate static web pages with JavaScript and external links to Django.

Nomaan • Mar 19

After the model runs for a couple of minutes, these error is logged in the console and the model stops working until its refreshed again.

This is the error it is throwing.

Andrey Germanov • Mar 19

What do you have on line 12 of the "worker.js" ?

Andrey Germanov • Mar 19

Haven't experienced this, but seems that this is a bug in some version of onnxruntime-web, that others experienced on different models: github.com/xenova/transformers.js/...

Try to update ort-wasm-simd.wasm to last version and use last version of ort.min.js.

h-pozuelo • Jun 1 '23

Hello, I've just followed all of your tutorial.
But I am getting this error:

I have just copy / paste all your code, but I don't know why I get this error.

I'm running the project on Visual Studio Code, with Live Server extension.
I think the separated thread 'worker.js' is giving the error.

Can you help me solve it?

Andrey Germanov • Jun 1 '23 • Edited

Hello,
This is a common error when import ONNX runtime from the worker. It can't download the required WASM file automatically.

Did you download the ort-wasm-simd.wasm file to the project folder?

cdn.jsdelivr.net/npm/onnxruntime-w...

h-pozuelo • Jun 1 '23

Oh, I forgot:

I thought this line of code: importScripts("cdn.jsdelivr.net/npm/onnxruntime-w...); | on the worker.js was the only thing I need it

Andrey Germanov • Jun 1 '23 • Edited

It's ok. This annoying issue mentioned in the "Running the model in background thread" section, and the link to this file also should be there.

h-pozuelo • Jun 1 '23

btw, if my model was trained using pyTorch+GPU, is there gonna be any problem?

Andrey Germanov • Jun 1 '23

No, if it's based on YOLOv8 and successfully exported to ONNX.

h-pozuelo • Jun 1 '23

Okey, thx.
Another question.
Is my model has only 26 labels (is an American Sign Language detection), I also have to modify the following line: (( const [class_id, prob] = [...Array(80).keys()] // THIS LINE
)) aND CHANGE 80 TO 26?

Andrey Germanov • Jun 1 '23 • Edited

Yes, you should.

Andrey Germanov • Jun 1 '23

Or you can replace it like here:

const [class_id,prob] = [...Array(yolo_classes.length).keys()]

Sarmad Kamal • Nov 6

@andreygermanov
Hi. I am facing this error can you please help me.
I have same code in index.html, worker.js, object_detector.js files and I have also yolov8n.onnx file and latest ort-wasm-simd.wasm file. but I am facing issue regarding ort-wasm-simd-threaded.mjs http://127.0.0.1:5500/ort-wasm-simd-threaded.mjs

Andrey Germanov • Nov 6 • Edited

Hi,

It looks like Microsoft does not care about backward compatibility, when create new releases of ONNX Runtime.

So, please manually download ort-wasm-simd-threaded.mjs and ort-wasm-simd-threaded.wasm files from here: cdn.jsdelivr.net/npm/onnxruntime-w..., put them to the root of your project and try again.

Sarmad Kamal • Nov 7

@andreygermanov
Hi, Hope you are doing well.

Thank you very very much for answering my query. Issue was resolved and now working properly for me after adding these two files manually. You are shining Hero <3 .

Best regards.

Foxconn.AI Tuan Anh • Sep 27 '23 • Edited

I already follow your step, and download code in gg drive here (someone comment that this code run ok):
drive.google.com/drive/folders/1FQ...

I changed the onnx path.
I put it into a folder of XAMPP on my local server (localhost do not have https), when run it show problem with WASM. How do i solve, please?

Errors:

wasm streaming compile failed: LinkError: WebAssembly.instantiate(): Import #37 module="a" function="L": function import requires a callable
(anonymous) @ ort-wasm.js:15
ort-wasm.js:15  falling back to ArrayBuffer instantiation
(anonymous) @ ort-wasm.js:15
ort-wasm.js:14  failed to asynchronously prepare wasm: LinkError: WebAssembly.instantiate(): Import #37 module="a" function="L": function import requires a callable
(anonymous) @ ort-wasm.js:14
ort-wasm.js:13  Aborted(LinkError: WebAssembly.instantiate(): Import #37 module="a" function="L": function import requires a callable)
G @ ort-wasm.js:13
backend-impl.js:91  Uncaught (in promise) Error: no available backend found. ERR: [wasm] RuntimeError: Aborted(LinkError: WebAssembly.instantiate(): Import #37 module="a" function="L": function import requires a callable). Build with -sASSERTIONS for more info.
    at resolveBackend (backend-impl.js:91:1)
    at async InferenceSession.create (inference-session-impl.js:175:1)
    at async run_model (worker.js:17:19)
    at async onmessage (worker.js:10:20)

Andrey Germanov • Sep 27 '23 • Edited

Hello,

The WASM file is outdated.

Please replace the ort-wasm-simd.wasm file from here cdn.jsdelivr.net/npm/onnxruntime-w... and try again.

fatmaboodai • Oct 21 '23

Hello,
Is there a way that the detection starts from the first second the video is played?

I’m trying to build a warning system where if a Specific label was detected within the frame an webpage alert will be displayed

But because the detection doesn’t start from the first second the first few frames is not being detected

Do you have any idea how i can make this work?

Andrey Germanov • Oct 24 '23 • Edited

To capture each individual frame you can run the model inside "timeupdate" event handler of the video player, like here:

video.addEventListener("timeupdate", async() => {
    const canvas = document.querySelector("canvas");
    canvas.width = video.videoWidth;
    canvas.height = video.videoHeight;
    const context = canvas.getContext("2d");
    context.drawImage(video,0,0);
    const input = prepare_input(canvas);
    const output = await run_model(input);
    boxes =  process_output(output, canvas.width, canvas.height);
    // find required label inside "boxes" array
})

Also, you can repeat the same code inside "play" event handler to ensure that it captures the earliest frame right in a moment when it starts playing.

fatmaboodai • Oct 27 '23

Thank you so much i really appreciate it

chiheb nouri • Mar 23

first of all thank you for beging so helpful.i have a problem.i downloaded your code and tried to run it with webserver exetention in vscode but only the video work with no detections and when i clicked inspect elements in the browser i got this error:Error: no available backend found. ERR: [wasm] RuntimeError: indirect call to null, [cpu] Error: previous call to 'initializeWebAssembly()' failed., [xnnpack] Error: previous call to 'initializeWebAssembly()' failed.

Andrey Germanov • Mar 23 • Edited

From time to time, Microsoft updates the ONNX runtime library without worrying about backward compatibility. This problem already discussed here before. To solve it, ensure that the version of the "ort.min.js" that you import matches the version of "ort-wasm-simd.wasm" binary that you downloaded. Do the following:

1 Download the last ort-wasm-simd.wasm file from here: cdn.jsdelivr.net/npm/onnxruntime-w...

2 Ensure that you load the last version of the "ort.min.js". The first line of the "worker.js" should be:

importScripts("https://cdn.jsdelivr.net/npm/onnxruntime-web/dist/ort.min.js");

3 Perhaps, you will need to restart live server to apply these changes.

chiheb nouri • Mar 23 • Edited

Thank you so much it worked but i still have 2 problems.i tried using my webcam and its very slow.how can we optimize it?i want to count detections after they pass a line(i did that in python with opencv when i was using flask before i saw your solution) how can put that logic in your solution?thank you

Andrey Germanov • Mar 25 • Edited

Hi, not sure how you can dramatically increase the YOLOv8 inference speed without GPU.

To count detections that passed a line, you need to use a simple geometry. If you have coordinates of detected box [x1,y1,x2,y2] for each frame and if you have coordinates of the line [x1,y1,x2,y2], you can calculate the intersection and see if the detected box passed it or not.

camerayuhang • Nov 26 '23 • Edited

First of all, thank you very much for your tutorials. I've followed all of your tutorials and have some questions. I hope you can help me clarify.

In your tutorial, you use the canvas element to replace the video element. Each prediction, the canvas simultaneously draws the current video frame and bounding boxes. In my project, I still use the video element to display the video, with the canvas overlaying on the video element for drawing. This way, video controls are retained. Would the latter approach be better since the canvas doesn't need to draw the image, only the bounding boxes?
In the official Canvas documentation, OffscreenCanvas and workers enable rendering operations to run in a separate thread, avoiding heavy work on the main thread. Therefore, moving the run_model function and drawing bounding boxes into a worker should further enhance performance.
In the run_model function, you reload the model for every prediction. Moving the model loading outside the detection loop should significantly improve speed. In my code, loading an ONNX format model takes about 400ms. I don't know why you reload the model every time, your real-time detection performance still remains good.
I trained a custom dataset using the pre-trained YOLOv8m model and obtained a best.py model file with a size of 49MB. After converting it to an ONNX model, the file size increased to 98MB. However, my custom model takes over 4000ms to predict an image, which is insufficient for real-time detection tasks. I'm curious to know how many milliseconds it takes for you to predict an image and why my prediction time is so long. My two devices, an M1 MacBook Air and an Arch Linux machine with an i7-12700 processor, both exhibit inference times exceeding 4000ms

Manvi Aggarwal • Jun 13

const video = document.querySelector("video");

video.addEventListener("loadeddata", () => {
console.log(video.videoWidth, video.videoHeight);
})
after i run this piece of code I am getting error that says: ReferenceError: document is not defined
at Object. (c:\Users\Manvi\primevision2\yolov8_inference_video_javascript\object_detector.js:1:15)
at Module._compile (node:internal/modules/cjs/loader:1358:14)
at Module._extensions..js (node:internal/modules/cjs/loader:1416:10)
at Module.load (node:internal/modules/cjs/loader:1208:32)
at Module._load (node:internal/modules/cjs/loader:1024:12)
at Function.executeUserEntryPoint as runMain
at node:internal/main/run_main_module:28:49
can anyone plzz help

Andrey Germanov • Jun 13

Do you run this in a web browser or in Node.js ?
This code should be run in a web browser.

Also, sometimes it can happen in frameworks, like Next.js. In this case it's a framework related issue. Try search "document is not defined" or "document is not an object"

h-pozuelo • Jun 2 '23

Hello, I read on the onnx webpage that with the onnxruntime-web we can use webgl or wasm.
On your project you re using wasm. Do you know how can I use WebGL for pre-proccess?

Andrey Germanov • Jun 2 '23 • Edited

I did not use it in practice, because WebGL is not enough stable, it does not support all operators. It did not work with YOLOv8 model when I tried.

In general, you can try it when construct the model this way:

const model = await ort.InferenceSession.create('yolov8n.onnx',
{
executionProviders: ['webgl']
}
)

h-pozuelo • Jun 2 '23

with the same file .wasm?

Andrey Germanov • Jun 2 '23

.wasm file not required for WebGL

Foxconn.AI Tuan Anh • Sep 28 '23

I also face error when using webgl. Dont know how to solve :)

Andrey Germanov • Sep 28 '23 • Edited

No, YOLOv8 model has operators that not supported in ONNX WebGL implementation (at least in current version).

Arsalan Jibran • Jul 18

I have devexpress blazor web app, in my "wwwroot/models" folder I have put my onnx model.

It is not able to find or detect the path to onnx model.

error = Error: failed to load external data file: models/sevensegment.onnx
at gn (cdn.jsdelivr.net/npm/onnxruntime-w...)
at async Co.fetchModelAndCopyToWasmMemory (cdn.jsdelivr.net/npm/onnxruntime-w......

Before going to blazor i implemented your code and it was working. But stuck in my blazor project , why onnx model cannnot be found.

(async () => {
try {
const modelPath = "wwwroot/models/sevensegment.onnx";
// Ensure this path is correct
const session = await ort.InferenceSession.create(modelPath);
console.log("Model loaded successfully");

    // Add your code to use the model here
    // Example:
    // const inputTensor = new ort.Tensor(...);
    // const outputTensor = await session.run({ input: inputTensor });
    // console.log(outputTensor);

} catch (error) {
    console.error("Failed to load model:", error);
}

})();

h-pozuelo • Jun 1 '23

By the way, have you worked with tfjs format model exported from a Yolov8 model?
I don't know how to interpret the output tensor I got.
Thats because ONNX2tf export all info (bbox, score, class, ...) on just a tensor (not an array of tensors). So I'm unable to read it, I can't understand it.

Andrey Germanov • Jun 2 '23

No, haven't worked with it.

Dernil • Aug 19

Hi. I've tried this, but that doesn't work :( Here is the error. Can you help pls?

Sarmad Kamal • Nov 6

@durdur same error on my side as well. Have you any idea how to resolve it? any solution? Thanks.

Andrey Germanov • Nov 6 • Edited

Download ort-wasm-simd-threaded.mjs and ort-wasm-simd-threaded.wasm files from here: cdn.jsdelivr.net/npm/onnxruntime-w... to the root of your project and try again.

Arsalan Jibran • Jul 23

The bounding boxes are being drawn has a delay , with moving objects there is a delay in creation of boxes. How to achieve faster detection on frames.