<small>How to Write Video Plugins</small> === <!-- .slide: data-background-color="pink" --> <!-- .slide: data-transition="zoom" --> Expand the Power of Open Source Programmatic Video Manipulating Tools > [name=郭學聰 Hsueh-Tsung Kuo] [color=red] > [time=Sun, 04 Aug 2024] ###### CC BY-SA 4.0 --- <!-- .slide: data-transition="convex" --> ## Who am I? ![fieliapm](https://www.gravatar.com/avatar/2aef78f04240a6ac9ccd473ba1cbd1e3?size=2048 =384x384) <small>Someone (who?) said: a game programmer should be able to draw cute anime character(?)</small> ---- <!-- .slide: data-transition="convex" --> * A ~~programmer~~ coding peasant from game company in Taiwan. * Backend (and temporary frontend) engineer. * Usually develop something related to my work in Python, Ruby, ECMAScript, Golang, C#. * Built CDN-aware game asset update system. * Business large passenger vehicle driver. :bus: * Ride bike to help traffic jam. :racing_motorcycle: * Care about chaotic traffic in Taiwan. * Draw cute anime character in spare time. --- <!-- .slide: data-transition="convex" --> ## Outline ---- <!-- .slide: data-transition="convex" --> 4. Programmatic Video Manipulating Tools * Showcase * Structure * How to serve video frames? * Execution ---- <!-- .slide: data-transition="convex" --> 5. Comparison * AviSynth * History * Feature * AviSynthPlus * Feature * VapourSynth * Origin * Feature * Change ---- <!-- .slide: data-transition="convex" --> 6. Make Plugins * Build Environment * AviSynth &amp; AviSynthPlus * Script Usage * Plugins * Parallelism * VapourSynth * Script Usage * Plugins * Parallelism ---- <!-- .slide: data-transition="convex" --> 7. Design * Convention * API Interface Changed * Parallelism &amp; Async 8. Conclusion 9. Resource 10. Q&A --- <!-- .slide: data-transition="convex" --> ## Programmatic Video Manipulating Tools Non-linear video editor controlled entirely by scripting ---- <!-- .slide: data-transition="convex" --> ## Programmatic Video Manipulating Tools * Write script to describe how to edit video * Build filter graph * Serve video frames on the fly ---- <!-- .slide: data-transition="convex" --> ### Showcase ---- <!-- .slide: data-transition="convex" --> ### Structure ```graphviz digraph structure { nodesep=0.3 node [fontname=Courier,shape=box] edge [color=purple] subgraph cluster_AviSynth{ label="AviSynth" AviSynthCore AVSEffectFilter AVSSourceFilter } subgraph cluster_VapourSynth{ label="VapourSynth" VapourSynthCore VSEffectFilter VSSourceFilter } AviSynthCore [shape=component] AVSEffectFilter [shape=component] AVSSourceFilter [shape=component] VapourSynthCore [shape=component] VSEffectFilter [shape=component] VSSourceFilter [shape=component] VfW [label="<f0>VfW|<f1>DirectShow",shape=record] VideoPlayer [label="<f0>VideoPlayer|<f1>VideoEditor",shape=record] CLI->{AviSynthCore VapourSynthCore} [color=red] VideoPlayer->{VfW FileSystem} {VfW FileSystem}->{AviSynthCore VapourSynthCore} AviSynthCore->AVSEffectFilter AVSEffectFilter->AVSSourceFilter VapourSynthCore->VSEffectFilter VSEffectFilter->VSSourceFilter {rank=same;VideoPlayer CLI} {rank=same;FileSystem VfW} } ``` ---- <!-- .slide: data-transition="convex" --> ### Execution * Loading * Register functions * Parse the script * Build the filter graph * Runtime * Frame serving ---- <!-- .slide: data-transition="convex" --> ### Execution ```flow= st=>start: Open avs/vpy e=>end: Quit regfunc=>operation: Register functions parse=>operation: Parse the script build=>operation: Build the filter graph serve_frame=>subroutine: Serve frames event=>condition: Event release=>operation: Release resource st->regfunc(right)->parse(right)->build->event event(no@Close)->release->e event(yes@Frame request)->serve_frame(left)->event ``` ---- <!-- .slide: data-transition="convex" --> #### How to serve video frames? * Open it as video file * Hook AVIFile API * AVIFileOpen() * Hook file system * AVFS: ~~http://www.turtlewar.org/avfs/~~ ---- <!-- .slide: data-transition="convex" --> #### How to serve video frames? * Get frames by accessing API directly * Pipe CLI * avs2yuv (AviSynth) * VSPipe (VapourSynth) * Dedicated media framework filter * DirectShow * <small>https://github.com/CrendKing/avisynth_filter</small> * FFmpeg (AviSynth &amp; VapourSynth) * GStreamer --- <!-- .slide: data-transition="convex" --> ## Comparison ---- <!-- .slide: data-transition="convex" --> ### AviSynth &amp; AviSynthPlus http://avisynth.nl/index.php/Main_Page ---- <!-- .slide: data-transition="convex" --> ### AviSynth - History * Developed by Ben Rudiak-Gould, Edwin van Eggelen, Klaus Post, Richard Berg and Ian Brabham in May 2000 * Picked up and maintained by the open source community which is still active nowadays * 2.5.x * Old school * 2.6.x * Enhanced API * New plugins framework * add 10bit 16bit support ---- <!-- .slide: data-transition="convex" --> ### AviSynth - Feature * Single-threaded * Dedicated script language * Trim video &amp; audio together * Filters work on raw video frame data * BGR interleaved(packed) * DWORD aligned lines, need pitch info * YUV planar ---- <!-- .slide: data-transition="convex" --> ### AviSynthPlus - Feature * Default single-threaded * Use Prefetch() will get frames from its child node using multiple threads * Dedicated script language * Trim video &amp; audio together * Filters work on raw video frame data * BGR interleaved(packed) * DWORD aligned lines, need pitch info * YUV planar * Support 10bit 16bit ---- <!-- .slide: data-transition="convex" --> ### VapourSynth * https://github.com/vapoursynth/vapoursynth * https://www.vapoursynth.com/ ---- <!-- .slide: data-transition="convex" --> ### VapourSynth - Origin * Written by Fredrik Mellbin (myrsloik) between jobs * Inspired by AviSynth ---- <!-- .slide: data-transition="convex" --> ### VapourSynth - Feature * Always multi-threaded * Dispatch a thread for every frame context * Parallel between frames and nodes * Python script * Generalized colorspaces * Filters work on color channel planes * one plane = one RGB/YUV channel * Need stride info between lines in a plane * Per frame properties * Support for video with format change * High performance &amp; efficient memory usage ---- <!-- .slide: data-transition="convex" --> ### VapourSynth - Change * Since R55 * API v3.x :arrow_right: API v4 * Script function breaking change * Add audio support * But video and audio cannot be trimmed together ```python= # Assume NTSC standard framerate and 48kHz as default def framesToSamples(frameNum, framerate=29.97, samplerate=48000): return math.floor((samplerate/framerate)*frameNum) video = video[71:217640] audio = audio[framesToSamples(71):framesToSamples(217640)] ``` --- <!-- .slide: data-transition="convex" --> ## Make Plugins ---- <!-- .slide: data-transition="convex" --> ### Build Environment * Native * Visual Studio (Community Edition) * Porting * GCC * MinGW ---- <!-- .slide: data-transition="convex" --> ### AviSynth &amp; AviSynthPlus ---- <!-- .slide: data-transition="convex" --> ### AviSynth - Script Usage ```clike= video = AVISource("somevideo.avi") audio = WAVSource("music.wav") AudioDub(video, audio) # `last` will be audio dubbed result video Trim(0, 12000) ++ Trim(20000, 32000) ++ Trim(44000, 0) LanczosResize(320, 240) # AviSynth will serve the video in `last` ``` ---- <!-- .slide: data-transition="convex" --> ### AviSynth - Plugins API v2.5 ```cpp= class IClip { friend class PClip; friend class AVSValue; int refcnt; void AddRef() { InterlockedIncrement((long *)&refcnt); } void Release() { InterlockedDecrement((long *)&refcnt); if (!refcnt) delete this; } public: IClip() : refcnt(0) {} virtual int __stdcall GetVersion() { return AVISYNTH_INTERFACE_VERSION; } virtual PVideoFrame __stdcall GetFrame(int n, IScriptEnvironment* env) = 0; virtual bool __stdcall GetParity(int n) = 0; // return field parity if field_based, else parity of first field in frame virtual void __stdcall GetAudio(void* buf, __int64 start, __int64 count, IScriptEnvironment* env) = 0; // start and count are in samples virtual void __stdcall SetCacheHints(int cachehints,int frame_range) = 0 ; // We do not pass cache requests upwards, only to the next filter. virtual const VideoInfo& __stdcall GetVideoInfo() = 0; virtual __stdcall ~IClip() {} }; typedef AVSValue (__cdecl *ApplyFunc)(AVSValue args, void* user_data, IScriptEnvironment* env); extern "C" __declspec(dllexport) const char* __stdcall AvisynthPluginInit2(IScriptEnvironment * env) ``` ---- <!-- .slide: data-transition="convex" --> ### AviSynth - Plugins see AVSPlugin.cpp ---- <!-- .slide: data-transition="convex" --> ### AviSynth - Parallelism * SetFilterMTMode() * Prefetch() will get frames from its child node using multiple threads ---- <!-- .slide: data-transition="convex" --> ### AviSynth - Parallelism ```clike= SetFilterMTMode("DEFAULT_MT_MODE", 2) # or SetFilterMTMode("DEFAULT_MT_MODE", MT_MULTI_INSTANCE) SetFilterMTMode("FFVideoSource", 3) # or SetFilterMTMode("FFVideoSource", MT_SERIALIZED) FFVideoSource(...) Trim(...) QTGMC(...) ... # Enable MT! Prefetch(4) ``` ---- <!-- .slide: data-transition="convex" --> ### AviSynth - Parallelism ```graphviz digraph structure { nodesep=1.0 node [fontname=Courier,shape=record] edge [color=purple] QTGMC [label="<0>QTGMC|<1>#1|<2>#2|<3>#3"] Trim [label="<0>Trim|<1>#1|<2>#2|<3>#3"] FFVideoSource [label="<0>FFVideoSource|<1>#101|<2>#102|<3>#103"] AviSynthCore->Prefetch Prefetch->QTGMC:1 Prefetch->QTGMC:2 Prefetch->QTGMC:3 QTGMC:1->Trim:1 QTGMC:2->Trim:2 QTGMC:3->Trim:3 Trim:1->FFVideoSource:1 Trim:2->FFVideoSource:2 [color=pink,style=dashed] Trim:3->FFVideoSource:3 [color=pink,style=dashed] } ``` ---- <!-- .slide: data-transition="convex" --> ### AviSynth - Parallelism Since AviSynthPlus 3.6 ```clike= # Filtering A Prefetch(1,4) # Filtering B Prefetch(4) # Filtering C Prefetch(1,4) ``` ---- <!-- .slide: data-transition="convex" --> ### VapourSynth ---- <!-- .slide: data-transition="convex" --> ### VapourSynth - Script Usage Until R54 ```python= from vapoursynth import core video = core.ffms2.Source(source='Rule6.mkv') video = core.std.Transpose(video) video.set_output() ``` ---- <!-- .slide: data-transition="convex" --> ### VapourSynth - Script Usage Since R55 ```python= from vapoursynth import core video = core.bs.VideoSource(source='filename.mkv') audio = core.bs.AudioSource(source='filename.mkv') video = core.std.FlipHorizontal(video) audio = core.std.AudioGain(audio,gain=2.0) video.set_output(index=0) audio.set_output(index=1) ``` ---- <!-- .slide: data-transition="convex" --> ### VapourSynth - Plugins API v3.x ```cpp= typedef void (VS_CC *VSFilterInit)(VSMap *in, VSMap *out, void **instanceData, VSNode *node, VSCore *core, const VSAPI *vsapi); typedef const VSFrameRef *(VS_CC *VSFilterGetFrame)(int n, int activationReason, void **instanceData, void **frameData, VSFrameContext *frameCtx, VSCore *core, const VSAPI *vsapi); typedef void (VS_CC *VSFilterFree)(void *instanceData, VSCore *core, const VSAPI *vsapi); typedef void (VS_CC *VSPublicFunction)(const VSMap *in, VSMap *out, void *userData, VSCore *core, const VSAPI *vsapi); VS_EXTERNAL_API(void) VapourSynthPluginInit(VSConfigPlugin configFunc, VSRegisterFunction registerFunc, VSPlugin* plugin) ``` ---- <!-- .slide: data-transition="convex" --> ### VapourSynth - Plugins see VSPlugin.cpp ---- <!-- .slide: data-transition="convex" --> ### VapourSynth - Parallelism * Call source filter GetFrame() once * Call non-source filter GetFrame() twice * activationReason = arInitial * vsapi->requestFrameFilter() * To queue the frame context into global task list * activationReason = arAllFramesReady * vsapi->getFrameFilter() * To get result frame from the frame context ---- <!-- .slide: data-transition="convex" --> ### VapourSynth - Parallelism * Thread pool will dispatch every task (frame context) * Locate the filter from the frame context * Trigger request/get frame of the filter * Store the frame to the frame context ---- <!-- .slide: data-transition="convex" --> ### VapourSynth - Parallelism ```python= import vapoursynth as vs video = vs.core.video_input_source.VideoInputSource(0, 'USB', 1280, 720) bilateral_video = vs.core.bilateral.Bilateral(video, sigmaS=20.0, sigmaR=0.1) bilateral_video.set_output() ``` ---- <!-- .slide: data-transition="convex" --> ### VapourSynth - Parallelism ```graphviz digraph structure { nodesep=1.0 node [fontname=Courier,shape=component] edge [color=purple] Output [shape=house] FrameContextQueue [shape=cylinder] Output->FrameContextQueue [color=red,style=dashed] FrameContextQueue->Bilateral [color=red,style=dashed] Bilateral->FrameContextQueue [color=blue,style=dashed] FrameContextQueue->VideoInputSource [color=blue,style=dashed] VideoInputSource->FrameContextQueue [color=purple] FrameContextQueue->Bilateral [color=purple] Bilateral->FrameContextQueue [color=orange] FrameContextQueue->Output [color=orange] } ``` red :arrow_right: blue :arrow_right: purple :arrow_right: orange ---- <!-- .slide: data-transition="convex" --> ### VapourSynth - Parallelism Implementation vapoursynth/src/core/vsthreadpool.cpp ---- <!-- .slide: data-transition="convex" --> ### VapourSynth - Parallelism # Cache implicitly inserted as parent node of every filter without flag *nfNoCache* at creating vapoursynth/src/cython/vapoursynth.pyx ---- <!-- .slide: data-transition="convex" --> ### VapourSynth - Parallelism * Spatial filter * fmParallel: directly get a frame * Temporal filter * FIR * fmParallel: directly get frames * Cache bless you :rolling_on_the_floor_laughing: * IIR * fmParallelRequests or fmUnordered ---- <!-- .slide: data-transition="convex" --> ### VapourSynth - Parallelism ```graphviz digraph structure { nodesep=1.0 node [fontname=Courier,shape=record] edge [color=purple] FIR [label="<0>FIR|<2>#2"] IIR [label="<0>IIR|<1>#1|<2>#2|<3>#3"] Child1 [label="<0>Child1|<1>#1|<2>#2|<3>#3"] Child2 [label="<0>Child2|<1>#1|<2>#2|<3>#3"] FIR:2->Child1:1 FIR:2->Child1:2 FIR:2->Child1:3 IIR:3->IIR:2 IIR:3->IIR:1 IIR:3->Child2:1 IIR:3->Child2:2 IIR:3->Child2:3 } ``` --- <!-- .slide: data-transition="convex" --> ## Design ---- <!-- .slide: data-transition="convex" --> ## Design Why? * AviSynth * IScriptEnvironment * VapourSynth * VSAPI * VSActivationReason ---- <!-- .slide: data-transition="convex" --> ### Convention * AviSynth * Register create() * create() return IClip * AviSynth C * Register create() * create() create a node with get_frame() * VapourSynth * Register create() * create() create a node with init(), getFrame(), free() ---- <!-- .slide: data-transition="convex" --> ### API Interface Changed * IScriptEnvironment * AviSynthPlus/avs_core/include/avisynth.h ```cpp= IScriptEnvironment* __stdcall CreateScriptEnvironment(int version = AVISYNTH_INTERFACE_VERSION); ``` ---- <!-- .slide: data-transition="convex" --> ### API Interface Changed * VSAPI * vapoursynth/include/VapourSynth.h ```cpp= VS_API(const VSAPI *) getVapourSynthAPI(int version) VS_NOEXCEPT; ``` ---- <!-- .slide: data-transition="convex" --> ### Parallelism &amp; Async ```cpp= typedef const VSFrameRef *(VS_CC *VSFilterGetFrame)(int n, int activationReason, void **instanceData, void **frameData, VSFrameContext *frameCtx, VSCore *core, const VSAPI *vsapi); ``` * activationReason * arInitial * arAllFramesReady ---- <!-- .slide: data-transition="convex" --> ### Parallelism &amp; Async async/await would be better? ```cpp= async const VSFrameRef* VS_CC VSFilterGetFrame(int n, int activationReason, void **instanceData, void **frameData, VSFrameContext *frameCtx, VSCore *core, const VSAPI *vsapi) { // ... const VSFrameRef *src = await vsapi->getFrameFilterAsync(n, d->node, frameCtx); // ... } ``` ---- <!-- .slide: data-transition="convex" --> ### Parallelism &amp; Async * Based on: * fiber framework * thread framework * native support in language * ... ---- <!-- .slide: data-transition="convex" --> ### Parallelism &amp; Async Ugly, but simple ```cpp= const VSFrameRef* VS_CC VSFilterGetFrame(int n, int activationReason, void **instanceData, void **frameData, VSFrameContext *frameCtx, VSCore *core, const VSAPI *vsapi) { if (activationReason == arInitial) { vsapi->requestFrameFilter(n, d->node, frameCtx); } else if (activationReason == arAllFramesReady) { const VSFrameRef *src = vsapi->getFrameFilter(n, d->node, frameCtx); // ... } } ``` --- <!-- .slide: data-transition="convex" --> ## Conclusion ---- <!-- .slide: data-transition="convex" --> ### Suggestion * Think design reasons * Watch the details of parallel behavior ---- <!-- .slide: data-transition="convex" --> ### Slogan :hash: {發揮你的駭客精神吧|<big>Use your hacker spirit</big>} > [name=郭學聰 Hsueh-Tsung Kuo] [time=2024_08_04] [color=red] :notebook: --- <!-- .slide: data-transition="convex" --> ## Resource ---- <!-- .slide: data-transition="convex" --> ### Reference * AviSynth wiki[^ref1] * VapourSynth Official[^ref2] * Doom9 forum[^ref3] [^ref1]:<small>http://avisynth.nl/index.php/Main_Page</small> [^ref2]:<small>https://www.vapoursynth.com/</small> [^ref3]:<small>https://forum.doom9.org/</small> --- <!-- .slide: data-transition="zoom" --> ## Q&A --- <style> .reveal { background: #FFDFEF; color: black; } .reveal h2, .reveal h3, .reveal h4, .reveal h5, .reveal h6 { color: black; } .reveal code { font-size: 18px !important; line-height: 1.2; } .progress div{ height:14px !important; background: hotpink !important; } // original template .rightpart{ float:right; width:50%; } .leftpart{ margin-right: 50% !important; height:50%; } .reveal section img { background:none; border:none; box-shadow:none; } p.blo { font-size: 50px !important; background:#B6BDBB; border:1px solid silver; display:inline-block; padding:0.5em 0.75em; border-radius: 10px; box-shadow: 5px 5px 5px #666; } p.blo1 { background: #c7c2bb; } p.blo2 { background: #b8c0c8; } p.blo3 { background: #c7cedd; } p.bloT { font-size: 60px !important; background:#B6BDD3; border:1px solid silver; display:inline-block; padding:0.5em 0.75em; border-radius: 8px; box-shadow: 1px 2px 5px #333; } p.bloA { background: #B6BDE3; } p.bloB { background: #E3BDB3; } /*.slide-number{ margin-bottom:10px !important; width:100%; text-align:center; font-size:25px !important; background-color:transparent !important; }*/ iframe.myclass{ width:100px; height:100px; bottom:0; left:0; position:fixed; border:none; z-index:99999; } h1.raw { color: #fff; background-image: linear-gradient(90deg,#f35626,#feab3a); -webkit-background-clip: text; -webkit-text-fill-color: transparent; animation: hue 5s infinite linear; } @keyframes hue { from { filter: hue-rotate(0deg); } to { filter: hue-rotate(360deg); } } .progress{ height:14px !important; } .progress span{ height:14px !important; background: url("") repeat-x !important; } .progress span:after, .progress span.nyancat{ content: ""; background: url('') !important; width: 34px !important; height: 21px !important; border: none !important; float:right; margin-top:-7px; margin-right:-10px; } </style>
{"title":"How to Write Video Plugins","description":"View the slide with \"Slide Mode\".","slideOptions":"{\"spotlight\":{\"enabled\":false},\"allottedMinutes\":25}","contributors":"[{\"id\":\"ea27dcd7-a3f2-47c2-b25e-6760e7936c38\",\"add\":62195,\"del\":36144,\"latestUpdatedAt\":null}]"}
    209 views