Skip to content

Commit

Permalink
Module Docs (#2620)
Browse files Browse the repository at this point in the history
* update bert docs

* update based

* update bigcode

* add pixtral

* add flux as well
  • Loading branch information
zachcp authored Nov 16, 2024
1 parent 00d8a0c commit a3f200e
Show file tree
Hide file tree
Showing 5 changed files with 126 additions and 10 deletions.
6 changes: 3 additions & 3 deletions candle-transformers/src/models/based.rs
Original file line number Diff line number Diff line change
@@ -1,9 +1,9 @@
//! Based from the Stanford Hazy Research group.
//!
//! See "Simple linear attention language models balance the recall-throughput tradeoff", Arora et al. 2024
//! - [Arxiv](https://arxiv.org/abs/2402.18668)
//! - [Github](https://github.com/HazyResearch/based)
//!
//! - Simple linear attention language models balance the recall-throughput tradeoff. [Arxiv](https://arxiv.org/abs/2402.18668)
//! - [Github Rep](https://github.com/HazyResearch/based)
//! - [Blogpost](https://hazyresearch.stanford.edu/blog/2024-03-03-based)
use candle::{DType, Device, IndexOp, Module, Result, Tensor, D};
use candle_nn::{
Expand Down
59 changes: 56 additions & 3 deletions candle-transformers/src/models/bert.rs
Original file line number Diff line number Diff line change
@@ -1,8 +1,61 @@
//! BERT (Bidirectional Encoder Representations from Transformers)
//!
//! See "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding", Devlin et al. 2018
//! - [Arxiv](https://arxiv.org/abs/1810.04805)
//! - [Github](https://github.com/google-research/bert)
//! Bert is a general large language model that can be used for various language tasks:
//! - Compute sentence embeddings for a prompt.
//! - Compute similarities between a set of sentences.
//! - [Arxiv](https://arxiv.org/abs/1810.04805) "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
//! - Upstream [Github repo](https://github.com/google-research/bert).
//! - See bert in [candle-examples](https://github.com/huggingface/candle/tree/main/candle-examples/) for runnable code
//!
//! ```no_run
//! // for sentence embeddings
//! # use candle_core::Tensor;

Check failure on line 12 in candle-transformers/src/models/bert.rs

View workflow job for this annotation

GitHub Actions / Test Suite (macOS-latest, stable)

unresolved import `candle_core`
//! # use candle_nn::{VarBuilder, Module};
//! # fn main() -> candle_core::Result<()> {
//! # let model = todo!();
//! # let prompt = "Here is a test sentence";
//! let embeddings = model.forward(prompt)?;
//! // Returns tensor of shape [1, 7, 384]
//! println!("{embeddings}");
//! # Ok(())
//! # }
//!
//! // Different models can be loaded using the model ID
//! # use candle_core::Tensor;
//! # use candle_nn::{VarBuilder, Module};

Check failure on line 25 in candle-transformers/src/models/bert.rs

View workflow job for this annotation

GitHub Actions / Test Suite (macOS-latest, stable)

the name `VarBuilder` is defined multiple times

Check failure on line 25 in candle-transformers/src/models/bert.rs

View workflow job for this annotation

GitHub Actions / Test Suite (macOS-latest, stable)

the name `Module` is defined multiple times
//! # fn main() -> candle_core::Result<()> {

Check failure on line 26 in candle-transformers/src/models/bert.rs

View workflow job for this annotation

GitHub Actions / Test Suite (macOS-latest, stable)

the name `main` is defined multiple times
//! # let vb = todo!();
//! # let config = todo!();
//! let model = BertModel::load(vb, &config )?;
//! # Ok(())
//! # }
//!
//! // Gelu approximation
//! // You can get a speedup by configuring the model
//! // to use an approximation of the gelu activation:
//! # use candle_core::Tensor;
//! # use candle_nn::{VarBuilder, Module};

Check failure on line 37 in candle-transformers/src/models/bert.rs

View workflow job for this annotation

GitHub Actions / Test Suite (macOS-latest, stable)

the name `VarBuilder` is defined multiple times

Check failure on line 37 in candle-transformers/src/models/bert.rs

View workflow job for this annotation

GitHub Actions / Test Suite (macOS-latest, stable)

the name `Module` is defined multiple times
//! # fn main() -> candle_core::Result<()> {

Check failure on line 38 in candle-transformers/src/models/bert.rs

View workflow job for this annotation

GitHub Actions / Test Suite (macOS-latest, stable)

the name `main` is defined multiple times
//! # let mut config = todo!();
//! config.hidden_act = HiddenAct::GeluApproximate;
//! # Ok(())
//! # }
//!
//! // Similarities
//! // Bert can compute sentence embeddings which can then be used to calculate
//! // semantic similarities between sentences through cosine similarity scoring.
//! // The sentence embeddings are computed using average pooling across all tokens.
//! # use candle_core::Tensor;
//! # use candle_nn::{VarBuilder, Module};

Check failure on line 49 in candle-transformers/src/models/bert.rs

View workflow job for this annotation

GitHub Actions / Test Suite (macOS-latest, stable)

the name `VarBuilder` is defined multiple times

Check failure on line 49 in candle-transformers/src/models/bert.rs

View workflow job for this annotation

GitHub Actions / Test Suite (macOS-latest, stable)

the name `Module` is defined multiple times
//! # fn main() -> candle_core::Result<()> {

Check failure on line 50 in candle-transformers/src/models/bert.rs

View workflow job for this annotation

GitHub Actions / Test Suite (macOS-latest, stable)

the name `main` is defined multiple times
//! # let model = todo!();
//! let sentence1 = "The new movie is awesome";
//! let sentence2 = "The new movie is so great";
//! let emb1 = model.forward(sentence1)?;
//! let emb2 = model.forward(sentence2)?;
//! # Ok(())
//! # }
//! ```
//!
use super::with_tracing::{layer_norm, linear, LayerNorm, Linear};
use candle::{DType, Device, Result, Tensor};
Expand Down
18 changes: 17 additions & 1 deletion candle-transformers/src/models/bigcode.rs
Original file line number Diff line number Diff line change
@@ -1,9 +1,25 @@
//! BigCode implementation in Rust based on the GPT-BigCode model.
//!
//! See "StarCoder: A State-of-the-Art LLM for Code", Mukherjee et al. 2023
//! [StarCoder/BigCode](https://huggingface.co/bigcode/starcoderbase-1b) is a LLM
//! model specialized to code generation. The initial model was trained on 80
//! programming languages. See "StarCoder: A State-of-the-Art LLM for Code", Mukherjee et al. 2023
//! - [Arxiv](https://arxiv.org/abs/2305.06161)
//! - [Github](https://github.com/bigcode-project/starcoder)
//!
//! ## Running some example
//!
//! ```bash
//! cargo run --example bigcode --release -- --prompt "fn fact(n: u64) -> u64"
//!
//! > fn fact(n: u64) -> u64 {
//! > if n == 0 {
//! > 1
//! > } else {
//! > n * fact(n - 1)
//! > }
//! > }
//! ```
//!
use candle::{DType, Device, IndexOp, Result, Tensor, D};
use candle_nn::{embedding, linear_b as linear, Embedding, LayerNorm, Linear, Module, VarBuilder};
Expand Down
22 changes: 19 additions & 3 deletions candle-transformers/src/models/flux/mod.rs
Original file line number Diff line number Diff line change
@@ -1,10 +1,26 @@
//! Flux Model
//!
//! Flux is a series of text-to-image generation models based on diffusion transformers.
//! Flux is a 12B rectified flow transformer capable of generating images from text descriptions.
//!
//! - [GH Link](https://github.com/black-forest-labs/flux)
//! - Transformers Python [reference implementation](https://github.com/huggingface/transformers/blob/5af7d41e49bbfc8319f462eb45253dcb3863dfb7/src/transformers/models/chinese_clip/modeling_chinese_clip.py)
//! - [Hugging Face Model](https://huggingface.co/black-forest-labs/FLUX.1-schnell)
//! - [GitHub Repository](https://github.com/black-forest-labs/flux)
//! - [Blog Post](https://blackforestlabs.ai/announcing-black-forest-labs/)
//!
//! # Usage
//!
//! ```bash
//! cargo run --features cuda \
//! --example flux -r -- \
//! --height 1024 --width 1024 \
//! --prompt "a rusty robot walking on a beach holding a small torch, \
//! the robot has the word \"rust\" written on it, high quality, 4k"
//! ```
//!
//! <div align=center>
//! <img src="https://github.com/huggingface/candle/raw/main/candle-examples/examples/flux/assets/flux-robot.jpg" alt="" width=320>
//! </div>
//!
use candle::{Result, Tensor};

pub trait WithForward {
Expand Down
31 changes: 31 additions & 0 deletions candle-transformers/src/models/pixtral/mod.rs
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,38 @@
//! using images paired with text descriptions.
//!
//! - Transformers Python [reference implementation](https://github.com/huggingface/transformers/tree/main/src/transformers/models/pixtral)
//! - [Blog Post](https://mistral.ai/news/pixtral-12b/) -
//! - [HF Model Card](https://huggingface.co/mistralai/Pixtral-12B-2409) -
//! - [HF Community Model Card](https://huggingface.co/mistral-community/pixtral-12b).
//!
//! # Example
//!
//! <div align=center>
//! <img src="https://github.com/huggingface/candle/raw/main/candle-examples/examples/flux/assets/flux-robot.jpg" alt="" width=320>
//! </div>
//!
//! ```bash
//! cargo run --profile=release-with-debug \
//! --features cuda \
//! --example pixtral -- \
//! --image candle-examples/examples/flux/assets/flux-robot.jpg
//! ```
//!
//! ```txt
//! Describe the image.
//!
//! The image depicts a charming, rustic robot standing on a sandy beach at sunset.
//! The robot has a vintage, steampunk aesthetic with visible gears and mechanical
//! parts. It is holding a small lantern in one hand, which emits a warm glow, and
//! its other arm is extended forward as if reaching out or guiding the way. The
//! robot's body is adorned with the word "RUST" in bright orange letters, adding to
//! its rustic theme.
//!
//! The background features a dramatic sky filled with clouds, illuminated by the
//! setting sun, casting a golden hue over the scene. Gentle waves lap against the
//! shore, creating a serene and picturesque atmosphere. The overall mood of the
//! image is whimsical and nostalgic, evoking a sense of adventure and tranquility.
//! ```
pub mod llava;
pub mod vision_model;
Expand Down

0 comments on commit a3f200e

Please sign in to comment.