#

vision

Here are 1,748 public repositories matching this topic...

BVLC / caffe

Caffe: a fast open framework for deep learning.

machine-learning deep-learning vision

Updated Jul 31, 2024
C++

XTLS / Xray-core

Xray, Penetrates Everything. Also the best v2ray-core. Where the magic happens.

Updated Mar 5, 2025
Go

danny-avila / LibreChat

Enhanced ChatGPT Clone: Features Agents, DeepSeek, Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, Code Interpreter, langchain, DALL-E-3, OpenAPI Actions, Functions, Secure Multi-User Auth, Presets, open-source for self-hosting. Active project.

Updated Mar 4, 2025
TypeScript

PaddleHub

PaddlePaddle / PaddleHub

Awesome pre-trained models toolkit based on PaddlePaddle. (400+ models including Image, Text, Audio, Video and Cross-Modal with Easy Inference & Serving)

nlp awesome deep-learning model vision text2image

Updated Aug 7, 2024
Python

mediar-ai / screenpipe

AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recording

machine-learning ai computer-vision ml agi vision agents multimodal llm

Updated Mar 4, 2025
TypeScript

Skyvern-AI / skyvern

Automate browser-based workflows with LLMs and Computer Vision

python api workflow automation browser computer vision gpt browser-automation rpa playwright llm

Updated Mar 4, 2025
Python

mrousavy / react-native-vision-camera

📸 A powerful, high-performance React Native Camera library.

Updated Feb 27, 2025
Swift

Dooy / chatgpt-web-midjourney-proxy

One UI is all done with chatgpt web, midjourney, gpts,suno,luma,runway,viggle,flux,ideogram,realtime,pika,udio; Simultaneous support Web / PWA / Linux / Win / MacOS platform

flux realtime vision runway pika ideogram luma gpts midjourney chatgpt-ui midjourney-ui gptstore gpts-ui whisper-ui suno claude-3 udio viggle kling

Updated Feb 20, 2025
JavaScript

TEN-framework / TEN-Agent

TEN Agent is a conversational voice AI agent powered by TEN, integrating Deepseek, Gemini, OpenAI, RTC, and hardware like ESP32. It enables realtime AI capabilities like seeing, hearing, and speaking, and is fully compatible with platforms like Dify and Coze.

Updated Mar 4, 2025
Python

iOS-11-by-Examples

artemnovichkov / iOS-11-by-Examples

👨🏻‍💻 Examples of new iOS 11 APIs

swift vision xcode9 ios11 arkit coreml core-nfc

Updated Dec 31, 2021
Swift

donkeycar

autorope / donkeycar

Open source hardware and software platform to build a small scale self driving car.

python raspberry-pi tensorflow keras vision self-driving-car cv2 donkeycar jetson-nano

Updated Sep 15, 2024
Python

bytedance / UI-TARS-desktop

A GUI Agent application based on UI-TARS(Vision-Lanuage Model) that allows you to control your computer using natural language.

electron agent vision vlm vite gui-agents computer-use browser-use

Updated Mar 4, 2025
TypeScript

sightmachine / SimpleCV

The Open Source Framework for Machine Vision

python computer-vision cv image-processing vision visionprocessing

Updated Dec 20, 2024
Python

NextLevel / NextLevel

⬆️ Media Capture in Swift

swift ios instagram snapchat video camera custom photography augmented-reality ar media avfoundation capture vision nextlevel mixed-reality coreimage arkit tiktok

Updated Aug 12, 2024
Swift

GoogleCloudPlatform / java-docs-samples

Java and Kotlin Code samples used on cloud.google.com

kotlin java appengine video cdn auth samples vision translate automl

Updated Feb 27, 2025
Java

andyzeng / tsdf-fusion-python

Python code to fuse multiple RGB-D images into a TSDF voxel volume.

cuda artificial-intelligence vision rgbd 3d 3d-reconstruction depth-camera volumetric-data 3d-deep-learning tsdf kinect-fusion

Updated Feb 18, 2023
Python

roatienza / Deep-Learning-Experiments

Videos, notes and experiments to understand deep learning

nlp deep-learning speech pytorch artificial-intelligence vision deep-learning-tutorial

Updated Dec 15, 2024
Jupyter Notebook

KevinGong2013 / ChineseIDCardOCR

[Deprecated] 🇨🇳中国二代身份证光学识别

swift machine-learning deep-learning xcode cnn vision ios11 coreml

Updated Feb 7, 2018
Swift

OpenFind

aheze / OpenFind

An app to find text in real life.

swift photos ios app ocr camera uikit find realm vision hacktoberfest swiftui

Updated Feb 10, 2023
Swift

lucidrains / mlp-mixer-pytorch

An All-MLP solution for Vision, from Google AI

deep-learning vision

Updated Sep 13, 2024
Python

Improve this page

Add a description, image, and links to the vision topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vision topic, visit your repo's landing page and select "manage topics."