issue in cell 776, missing group by of feature columns #4

balalavanya · 2020-09-01T10:41:43Z

    grouped_columns = df.loc[partition].agg(aggregations, squeeze=False)

The text was updated successfully, but these errors were encountered:

balalavanya · 2020-09-01T11:43:49Z

not working for categorical column, error "unhashable type: 'list'"
i tried with features
feature_columns = ['age', 'occupation']

pablomoha · 2021-03-10T17:42:31Z

Hey Nithin, when i run this command:

dfn = build_anonymized_dataset(df, finished_partitions, feature_columns, sensitive_column)

it shows me this error.
can you help me please.
Thank you.

TypeError Traceback (most recent call last)
D:\anaconda\lib\site-packages\pandas\core\series.py in aggregate(self, func, axis, *args, **kwargs)
3960 try:
-> 3961 result = self.apply(func, *args, **kwargs)
3962 except (ValueError, AttributeError, TypeError):

D:\anaconda\lib\site-packages\pandas\core\series.py in apply(self, func, convert_dtype, args, **kwds)
4107 values = self.astype(object)._values
-> 4108 mapped = lib.map_infer(values, f, convert=convert_dtype)
4109

pandas_libs\lib.pyx in pandas._libs.lib.map_infer()

in agg_categorical_column(series)
1 def agg_categorical_column(series):
----> 2 return [','.join(set(series))]
3

TypeError: 'int' object is not iterable

During handling of the above exception, another exception occurred:

TypeError Traceback (most recent call last)
D:\anaconda\lib\site-packages\pandas\core\frame.py in aggregate(self, func, axis, *args, **kwargs)
7574 try:
-> 7575 result, how = self._aggregate(func, axis, *args, **kwargs)
7576 except TypeError as err:

D:\anaconda\lib\site-packages\pandas\core\frame.py in _aggregate(self, arg, axis, *args, **kwargs)
7605 return result, how
-> 7606 return aggregate(self, arg, *args, **kwargs)
7607

D:\anaconda\lib\site-packages\pandas\core\aggregation.py in aggregate(obj, arg, *args, **kwargs)
565 arg = cast(AggFuncTypeDict, arg)
--> 566 return agg_dict_like(obj, arg, _axis), True
567 elif is_list_like(arg):

D:\anaconda\lib\site-packages\pandas\core\aggregation.py in agg_dict_like(obj, arg, _axis)
751 # key used for column selection and output
--> 752 results = {key: obj._gotitem(key, ndim=1).agg(how) for key, how in arg.items()}
753

D:\anaconda\lib\site-packages\pandas\core\aggregation.py in (.0)
751 # key used for column selection and output
--> 752 results = {key: obj._gotitem(key, ndim=1).agg(how) for key, how in arg.items()}
753

D:\anaconda\lib\site-packages\pandas\core\series.py in aggregate(self, func, axis, *args, **kwargs)
3962 except (ValueError, AttributeError, TypeError):
-> 3963 result = func(self, *args, **kwargs)
3964

in agg_categorical_column(series)
1 def agg_categorical_column(series):
----> 2 return [','.join(set(series))]
3

TypeError: sequence item 0: expected str instance, int found

The above exception was the direct cause of the following exception:

TypeError Traceback (most recent call last)
in
----> 1 dfn = build_anonymized_dataset(df, finished_partitions, feature_columns, sensitive_column)

in build_anonymized_dataset(df, partitions, feature_columns, sensitive_column, max_partitions)
12 if max_partitions is not None and i > max_partitions:
13 break
---> 14 grouped_columns = df.loc[partition].agg(aggregations, squeeze=False)
15 sensitive_counts = df.loc[partition].groupby(sensitive_column).agg({sensitive_column : 'count'})
16 values = grouped_columns.iloc[0].to_dict()

D:\anaconda\lib\site-packages\pandas\core\frame.py in aggregate(self, func, axis, *args, **kwargs)
7579 f"incompatible data and dtype: {err}"
7580 )
-> 7581 raise exc from err
7582 if result is None:
7583 return self.apply(func, axis=axis, args=args, **kwargs)

TypeError: DataFrame constructor called with incompatible data and dtype: sequence item 0: expected str instance, int found

Arigato97 · 2021-05-12T05:10:01Z

AttributeError: 'list' object has no attribute 'to_dict' How to solve this mistake, please

glassonion1 · 2021-10-20T10:23:04Z

AttributeError: 'list' object has no attribute 'to_dict' How to solve this mistake, please

I got the same error. this is workaround below.

def agg_categorical_column(series):
    # workearound here
    series.astype('category')
    return [','.join(set(series))]

XDUqinian · 2022-05-30T15:03:57Z

AttributeError: 'list' object has no attribute 'to_dict' How to solve this mistake, please

I got the same error too. It could be found that the df. agg() returned a Series instead of a Dataframe, so I transformed it. this is workaround below.

grouped_columns = df.loc[partition].agg(aggregations, squeeze=False)
sensitive_counts = df.loc[partition].groupby(sensitive_column).agg({sensitive_column : 'count'})
#insert
df2=grouped_columns.to_frame()
grouped_columns=pd.DataFrame(df2.values.T,columns=df2.index)
#insert_end

IamWorld · 2024-03-01T05:27:51Z

I was able to make it work, I had to change the file anonypy.py on the lines 79 and 108

Replacing this line

values = grouped_columns.iloc[0].to_dict()

with this one

values= {}
for name,val in grouped_columns.items():
values[name] = val[0]

Hope it helps.

balalavanya closed this as completed Sep 1, 2020

balalavanya reopened this Sep 1, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

issue in cell 776, missing group by of feature columns #4

issue in cell 776, missing group by of feature columns #4

balalavanya commented Sep 1, 2020 •

edited

Loading

balalavanya commented Sep 1, 2020

pablomoha commented Mar 10, 2021

Arigato97 commented May 12, 2021

glassonion1 commented Oct 20, 2021 •

edited

Loading

XDUqinian commented May 30, 2022 •

edited

Loading

IamWorld commented Mar 1, 2024

issue in cell 776, missing group by of feature columns #4

issue in cell 776, missing group by of feature columns #4

Comments

balalavanya commented Sep 1, 2020 • edited Loading

balalavanya commented Sep 1, 2020

pablomoha commented Mar 10, 2021

it shows me this error. can you help me please. Thank you.

Arigato97 commented May 12, 2021

glassonion1 commented Oct 20, 2021 • edited Loading

XDUqinian commented May 30, 2022 • edited Loading

IamWorld commented Mar 1, 2024

balalavanya commented Sep 1, 2020 •

edited

Loading

it shows me this error.
can you help me please.
Thank you.

glassonion1 commented Oct 20, 2021 •

edited

Loading

XDUqinian commented May 30, 2022 •

edited

Loading