Batch sampling improvement #1154

dengdifan · 2024-10-23T13:53:01Z

Closing #1152
This is the first step towards solving the batch sampling

TODO:

checking if everything works well under batch setting
adding unit tests

benjamc

Update CHANGELOG.md
Possibly update example
Update docs / somewhere mention this new feature

benjamc · 2024-12-04T14:55:33Z

smac/main/config_selector.py

+                    Y_estimated = self.estimate_running_config_costs(
+                        X_running, Y, self._batch_sampling_estimation_strategy
+                    )
+                    if Y_estimated is not None:


in what cases could this be None?

ah when there are no running configs

benjamc · 2024-12-04T14:56:02Z

smac/main/config_selector.py

+            a np array with size (n_evaluated_configs, n_obj) that records the costs of all the previous evaluated
+            configurations
+        estimation_strategy: str
+            how do we estimate the target y_running values


copy docstring from above to add more info about the estimation strategy here

benjamc · 2024-12-04T14:56:41Z

smac/main/config_selector.py

+            Y_evaluated: np.ndarray,
+            estimation_strategy: str = 'CL_max'):
+        """
+        This function is implemented to estimate the still pending/ running configurations


add newline

benjamc · 2024-12-04T14:57:59Z

smac/main/config_selector.py

+            Y_estimated = np.nanmin(Y_evaluated, axis=0, keepdims=True)
+            return np.repeat(Y_estimated, n_running_points, 0)
+        elif estimation_strategy == 'CL_mean':
+            # constant liar min, we take the mean values of all the evaluated Y and apply them to the running X


should be constant liar mean instead of min

benjamc · 2024-12-04T14:58:48Z

smac/main/config_selector.py

+            # gaussian process
+            assert isinstance(self._model, GaussianProcess), 'Sample based estimate strategy only allows ' \
+                                                             'GP as surrogate model!'
+            return self._model.sample_functions(X_test=X_running, n_funcs=1)


Why cannot we sample from the random forest?

benjamc · 2024-12-04T14:59:50Z

smac/runhistory/encoder/abstract_encoder.py

+                trial: self.runhistory[trial]
+                for trial in self.runhistory
+                if self.runhistory[trial].status == StatusType.RUNNING
+                # and runhistory.data[run].time >= self._algorithm_walltime_limit  # type: ignore


why is this commented out / why would we need this?
If it should stay there commented, please explain why

benjamc · 2024-12-04T15:00:37Z

smac/runhistory/encoder/abstract_encoder.py

+                trial: self.runhistory[trial]
+                for trial in self.runhistory
+                if self.runhistory[trial].status == StatusType.RUNNING
+                # and runhistory.data[run].time >= self._algorithm_walltime_limit  # type: ignore


benjamc · 2024-12-04T15:01:03Z

smac/runhistory/encoder/abstract_encoder.py

@@ -211,6 +234,13 @@ def _get_timeout_trials(

        return trials

+    def _convert_config_ids_to_array(self,
+                                     config_ids: Iterable[int]) -> np.ndarray:
+        """extract the configurations from rh and transform them into np array"""


write proper docstring with Parameters and return values

dengdifan added 2 commits October 23, 2024 15:46

allow encoder to return running configs

9676189

add options for batch sampling

6b86808

benjamc mentioned this pull request Oct 24, 2024

Migrate docs #1155

Merged

maint constant liar with nan values

06719d2

benjamc added this to the v2.3 milestone Nov 27, 2024

benjamc added enhancement feature labels Nov 27, 2024

Merge branch 'development' into batch_sampling_improvement

8b748db

benjamc assigned dengdifan Dec 2, 2024

dengdifan added 2 commits December 2, 2024 15:46

add docs

2487bc3

tests for config selectors

d3f4f11

dengdifan requested a review from benjamc December 2, 2024 14:50

benjamc requested changes Dec 4, 2024

View reviewed changes

benjamc linked an issue Dec 4, 2024 that may be closed by this pull request

Improve batch sampling #1152

Open

solve conflict

3c2196a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batch sampling improvement #1154

Batch sampling improvement #1154

dengdifan commented Oct 23, 2024

benjamc left a comment

benjamc Dec 4, 2024

benjamc Dec 4, 2024

benjamc Dec 4, 2024

benjamc Dec 4, 2024

benjamc Dec 4, 2024

benjamc Dec 4, 2024

benjamc Dec 4, 2024

benjamc Dec 4, 2024

benjamc Dec 4, 2024

Batch sampling improvement #1154

Are you sure you want to change the base?

Batch sampling improvement #1154

Conversation

dengdifan commented Oct 23, 2024

benjamc left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment