Skip to content
View imoneoi's full-sized avatar
🎯
Tuning PPO
🎯
Tuning PPO

Organizations

@OpenOrca @FastEval

Block or report imoneoi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. openchat openchat Public

    OpenChat: Advancing Open-source Language Models with Imperfect Data

    Python 5.3k 401

  2. openchat-ui openchat-ui Public

    Forked from mckaywrigley/chatbot-ui

    An open source UI for OpenChat models

    TypeScript 262 55

  3. multipack_sampler multipack_sampler Public

    Multipack distributed sampler for fast padding-free training of LLMs

    Python 182 13

  4. onerl onerl Public

    One RL Platform is all you need -- Event-driven fully distributed reinforcement learning framework

    Python 18 4

  5. EvolvingConnectivity EvolvingConnectivity Public

    Code for paper Evolving Connectivity for Spiking Neural Networks

    Python 16 3

  6. autonomous_driving_mpc autonomous_driving_mpc Public

    Model Predictive Controller for Autonomous Driving implemented using ROS and C++

    C++ 87 30