Unused variables are added to Direct solvers #1490

ruaridhw · 2020-06-08T12:36:17Z

In DirectOrPersistentSolver._add_block() all variables are looped over, added to the model, and then _referenced_variables is only computed after the fact as part of adding the constraints etc.

How does one deactivate variables? The only suggestion I can find is to fix them however this will still cause them to be added to the model unnecessarily. For very large models that are solved iteratively in a decomposition routine, this is very slow. In contrast, the (CPLEX) LP solver is very fast for models where only a small fraction of variables are relevant since it checks over _referenced_variable_ids.

If we cannot deactivate variables, how would we get around this? The only thing I can think of is a design where all the variable data to be sent to the Direct solver is cached until all the constraints, SOS, and objective(s) have also been computed, the variables are then filtered according to _referenced_variables and added, and finally the constraints, SOS, and objective(s) are added. This would follow on from the deferred logic of #1416.

The text was updated successfully, but these errors were encountered:

ghackebeil · 2020-06-08T12:51:58Z

This discussion is likely going to open up a big can of worms. In short, I have added the ability to deactivate variables to pyomo.kernel. I hope this can be pushed to the pyomo.environ interface to enable the kinds of performance improvements you mentioned.

michaelbynum · 2020-06-08T12:53:06Z

A couple comments.

The proposed approach could (I have not tested it, so I don't know for sure) slow down the addition of new constraints since each time add_constraint is called, you now have to ensure all of the variables used in the constraint are already in the CPLEX model.
I would find it slightly strange to have a model where a large potion of the variables are not used in constraints or the objective. Do you have an example application or algorithm where this is the case? In most cases, it should be relatively easy to control which variables go to the solver by controlling which blocks are active.

michaelbynum · 2020-06-08T12:53:46Z

@ghackebeil Makes a good point.

ruaridhw · 2020-06-08T14:18:25Z

The proposed approach could (I have not tested it, so I don't know for sure) slow down the addition of new constraints since each time add_constraint is called, you now have to ensure all of the variables used in the constraint are already in the CPLEX model.

I think this would be pretty fast considering you have access to _referenced_variables inside add_constraint() which is the Set of all of the VarData already added. It would be a quick set diff to compare the new variables to that.

I would find it slightly strange to have a model where a large potion of the variables are not used in constraints or the objective. Do you have an example application or algorithm where this is the case? In most cases, it should be relatively easy to control which variables go to the solver by controlling which blocks are active.

As an example, in the Job Shop scheduling problem it is typical to decompose the problem by "machine" even though many "job"s can go to multiple machines. This means the vast majority of the allocation variables of the jobs you are solving for are irrelevant (as they are for other machines).

Thanks, @ghackebeil! That's great to know.

michaelbynum · 2020-06-08T14:26:56Z

@ruaridhw You are probably right. The set difference is probably negligible, in which case I would be fine with the change you proposed.

ruaridhw · 2020-06-08T15:35:22Z

You're correct in that it will be (marginally) slower for incrementally adding constraints/variables. I guess I'm going for the approach of optimising the build of a full model because IMO for large applications, this is likely to be a limiting factor faster than incremental model changes.

The above approach would also avoid needing to deactivate variables as fixing them would have the same effect and ensure that they are never added to the solver_model.

michaelbynum · 2020-06-08T15:38:59Z

Fixed variables are trickier. When a variable is unfixed, you have to ensure that any constraints already added to the model that depend on that variable get updated. This is the primary reason fixed variables are added to the solver_model.

michaelbynum · 2020-06-08T15:43:05Z

I am currently working on refactoring the persistent solver interfaces. The refactor should take care of some of these issues (and adds some features like an automatic update mode), but it won't be ready for another year or so.

ruaridhw · 2020-06-09T15:42:01Z

Fixed variables are trickier. When a variable is unfixed, you have to ensure that any constraints already added to the model that depend on that variable get updated. This is the primary reason fixed variables are added to the solver_model.

Correct me if I'm wrong but I thought this was the whole reason for #1244? In other words, this isn't actually true of the current design either?

Also it's worth pointing out that my title refers to "Direct" solvers. I realise that currently Direct and Persistent solvers share a lot of the same code however due to the bugs in the Persistent interface (such as #1244) that are not apparent in the Direct, I think it should be easier to implement this for Direct only for the time being.

michaelbynum · 2020-06-09T16:44:46Z

The key to #1244 is that the bug only occurs if the variables are fixed when the constraints are first added. If the variables are not fixed when the constraints, then you can fix and unfix variables later with the correct behavior.

michaelbynum · 2020-06-09T16:45:42Z

But yes, you make a good point about the distinction between direct and persistent.

blnicho added the pyomo.solvers label Jun 8, 2020

mrmundt mentioned this issue Jan 4, 2024

[PEP] Redesign of Pyomo Solvers #1030

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unused variables are added to Direct solvers #1490

Unused variables are added to Direct solvers #1490

ruaridhw commented Jun 8, 2020

ghackebeil commented Jun 8, 2020

michaelbynum commented Jun 8, 2020

michaelbynum commented Jun 8, 2020

ruaridhw commented Jun 8, 2020 •

edited

Loading

michaelbynum commented Jun 8, 2020

ruaridhw commented Jun 8, 2020

michaelbynum commented Jun 8, 2020

michaelbynum commented Jun 8, 2020

ruaridhw commented Jun 9, 2020

michaelbynum commented Jun 9, 2020

michaelbynum commented Jun 9, 2020

Unused variables are added to Direct solvers #1490

Unused variables are added to Direct solvers #1490

Comments

ruaridhw commented Jun 8, 2020

ghackebeil commented Jun 8, 2020

michaelbynum commented Jun 8, 2020

michaelbynum commented Jun 8, 2020

ruaridhw commented Jun 8, 2020 • edited Loading

michaelbynum commented Jun 8, 2020

ruaridhw commented Jun 8, 2020

michaelbynum commented Jun 8, 2020

michaelbynum commented Jun 8, 2020

ruaridhw commented Jun 9, 2020

michaelbynum commented Jun 9, 2020

michaelbynum commented Jun 9, 2020

ruaridhw commented Jun 8, 2020 •

edited

Loading