Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[html] Opening up an html table with interspersed headers #1346

Open
anjakefala opened this issue Apr 5, 2022 · 2 comments
Open

[html] Opening up an html table with interspersed headers #1346

anjakefala opened this issue Apr 5, 2022 · 2 comments
Labels

Comments

@anjakefala
Copy link
Collaborator

anjakefala commented Apr 5, 2022

<Notkea> hello, I'm trying to extract data from an HTML table which does not have a header row. I end up with a single column and many empty rows (containing NoneType objects). Any hint of how I could get the data in the cells? The document looks like this: vd "https://webshop.calestor-periway.fr/product/Moniteurs-TV/Moniteurs/Samsung/Samsung-C49J890DKR-cran-LED-incurv-49-?searchtrack=ProductList&prodid=1437755&info=2"

@anjakefala anjakefala added the bug label Apr 5, 2022
@anjakefala
Copy link
Collaborator Author

anjakefala commented Apr 5, 2022

--header 0 does not seem to help. Opening an issue to investigate for when we have more focused time. Question was originally asked on #visidata.

@saulpw
Copy link
Owner

saulpw commented Jan 7, 2023

The table structure is <tr> alternately containing <th>/<td> tags. The html loader will have to do something different in this particular case.

@saulpw saulpw changed the title [html] Opening up an html table with no headers [html] Opening up an html table with interspersed headers Jan 7, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants