Rework regularized layers #73

BatyLeo · 2023-06-29T12:40:09Z

Now regularized layers are unified under the Regularized struct, and not under the IsRegularized anymore (partially adresses Get rid of SimpleTraits? #68). Every regularized layer is now a particular insance of Regularized
Specific constructors for SparseArgmax, SoftArgmax, and RegularizedFrankWolfe
Now we can also use Regularized with a custom optimizer (adresses Other solvers than FW for RegularizedGeneric #62)

TODO:

cleanup and docstrings
test Regularized with a custom optimizer

- Only one struct named `Regularized`, every regularized layer is a particular case of it - Specific constructors for `SparseArgmax`, `SoftArgmax`, and `RegularizedFrankWolfe` - Now we can also use `Regularized` with a custom optimizer (we may need to test this feature)

codecov-commenter · 2023-06-29T12:53:21Z

Codecov Report

Patch coverage: 77.77% and project coverage change: -0.25 ⚠️

Comparison is base (b9f84b9) 80.57% compared to head (3fc2bc1) 80.33%.

❗ Your organization is not using the GitHub App Integration. As a result you may experience degraded service beginning May 15th. Please install the Github App Integration for your organization. Read more.

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #73      +/-   ##
==========================================
- Coverage   80.57%   80.33%   -0.25%     
==========================================
  Files          19       20       +1     
  Lines         345      356      +11     
==========================================
+ Hits          278      286       +8     
- Misses         67       70       +3

Impacted Files	Coverage Δ
src/InferOpt.jl	`100.00% <ø> (ø)`
src/regularized/frank_wolfe_optimizer.jl	`25.00% <25.00%> (ø)`
src/regularized/regularized.jl	`78.57% <78.57%> (ø)`
ext/InferOptFrankWolfeExt.jl	`100.00% <100.00%> (ø)`
src/fenchel_young/fenchel_young.jl	`88.00% <100.00%> (ø)`
src/regularized/soft_argmax.jl	`100.00% <100.00%> (ø)`
src/regularized/sparse_argmax.jl	`100.00% <100.00%> (ø)`

☔ View full report in Codecov by Sentry.
📢 Do you have feedback about the report comment? Let us know in this issue.

gdalle · 2023-06-29T21:50:22Z

src/regularized/frank_wolfe_optimizer.jl

@@ -32,30 +32,14 @@ Some values you can tune:

 See the documentation of FrankWolfe.jl for details.
 """
-struct RegularizedGeneric{M,RF,RG,FWK}
-    maximizer::M
+struct FrankWolfeOptimizer{M,RF,RG,FWK}


I would rather call this FrankWolfeConcaveMaximizer

gdalle · 2023-06-29T21:50:47Z

src/regularized/regularized.jl

+"""
+struct Regularized{O,R}
+    Ω::R
+    optimizer::O


I would rather call this concave_maximizer to differentiate from (linear_)maximizer used elsewhere

gdalle · 2023-06-29T21:51:12Z

src/regularized/regularized.jl

+TODO
+"""
+function RegularizedFrankWolfe(linear_maximizer, Ω, Ω_grad, frank_wolfe_kwargs=NamedTuple())
+    # TODO : add a warning if DifferentiableFrankWolfe is not imported ?


gdalle · 2023-06-29T21:52:02Z

src/regularized/frank_wolfe_optimizer.jl

@@ -9,7 +9,7 @@ Relies on the Frank-Wolfe algorithm to minimize a concave objective on a polytop
    Since this is a conditional dependency, you need to run `import DifferentiableFrankWolfe` before using `RegularizedGeneric`.

 # Fields
- `maximizer::M`: linear maximization oracle `θ -> argmax_{x ∈ C} θᵀx`, implicitly defines the polytope `C`
+- `linear_maximizer::M`: linear maximization oracle `θ -> argmax_{x ∈ C} θᵀx`, implicitly defines the polytope `C`


Maybe we should use linear_maximizer throughout InferOpt?

gdalle · 2023-06-29T21:53:35Z

src/regularized/regularized.jl

+"""
+optimizer: θ ⟼ argmax θᵀy - Ω(y)
+"""
+struct Regularized{O,R}


Do we also need the linear maximizer as a field for when the layer is called outside of training?
It would make sense to me to modify the behavior of Perturbed as well so that the standard forward pass just calls the naked linear maximizer

gdalle · 2023-06-29T21:53:58Z

src/regularized/soft_argmax.jl

@@ -10,8 +10,12 @@ function soft_argmax(z::AbstractVector; kwargs...)
    return s
 end

-@traitimpl IsRegularized{typeof(soft_argmax)}
+# @traitimpl IsRegularized{typeof(soft_argmax)}


In the trash

gdalle · 2023-06-29T21:54:06Z

src/regularized/sparse_argmax.jl

@@ -10,11 +10,15 @@ function sparse_argmax(z::AbstractVector; kwargs...)
    return p
 end

-@traitimpl IsRegularized{typeof(sparse_argmax)}
+# @traitimpl IsRegularized{typeof(sparse_argmax)}


In the trash

gdalle · 2023-06-29T21:55:31Z

test Regularized with a custom optimizer

What do you have in mind? I think we can use a basic QP solver from JuMP or write our own with FISTA

BatyLeo added the enhancement New feature or request label Jun 29, 2023

BatyLeo linked an issue Jun 29, 2023 that may be closed by this pull request

Other solvers than FW for RegularizedGeneric #62

Open

gdalle requested changes Jun 29, 2023

View reviewed changes

gdalle marked this pull request as draft June 30, 2023 11:34

gdalle closed this Jun 30, 2023

BatyLeo deleted the regularized-rework branch December 23, 2024 11:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rework regularized layers #73

Rework regularized layers #73

BatyLeo commented Jun 29, 2023 •

edited

Loading

codecov-commenter commented Jun 29, 2023 •

edited

Loading

gdalle Jun 29, 2023

gdalle Jun 29, 2023

gdalle Jun 29, 2023

gdalle Jun 29, 2023

gdalle Jun 29, 2023

gdalle Jun 29, 2023

gdalle Jun 29, 2023

gdalle commented Jun 29, 2023

Rework regularized layers #73

Rework regularized layers #73

Conversation

BatyLeo commented Jun 29, 2023 • edited Loading

codecov-commenter commented Jun 29, 2023 • edited Loading

Codecov Report

gdalle Jun 29, 2023

Choose a reason for hiding this comment

gdalle Jun 29, 2023

Choose a reason for hiding this comment

gdalle Jun 29, 2023

Choose a reason for hiding this comment

gdalle Jun 29, 2023

Choose a reason for hiding this comment

gdalle Jun 29, 2023

Choose a reason for hiding this comment

gdalle Jun 29, 2023

Choose a reason for hiding this comment

gdalle Jun 29, 2023

Choose a reason for hiding this comment

gdalle commented Jun 29, 2023

BatyLeo commented Jun 29, 2023 •

edited

Loading

codecov-commenter commented Jun 29, 2023 •

edited

Loading