how to use? #7

RohitNegi12 · 2023-11-13T15:14:02Z

The lzma_decompress script decompresses the data into a different format which has .p extension and how to convert this .p into a standard form like csv ?

gokdumano · 2024-02-23T13:41:16Z

You can you this function to turn *.lzma files into records (dictionaries), then save them in any format you would like @RohitNegi12

from collections import defaultdict
from typing import Iterator

import compress_pickle

def lzma2records(fpath:str) -> Iterator[dict]:
    dd = defaultdict(list) 
    for values, fields in compress_pickle.load(fpath, compression='lzma'):
        for value, key in zip(values.split(), fields): dd[key].append(value)
        record = {key: ' '.join(value) for key, value in dd.items()}
        
        record['FullText'] = values

        yield record
        dd.clear()

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how to use? #7

how to use? #7

RohitNegi12 commented Nov 13, 2023

gokdumano commented Feb 23, 2024 •

edited

Loading

how to use? #7

how to use? #7

Comments

RohitNegi12 commented Nov 13, 2023

gokdumano commented Feb 23, 2024 • edited Loading

gokdumano commented Feb 23, 2024 •

edited

Loading