Using dynamo to selected Element ID's via a HTML file

JEAN-LUC_SOLDATIC · March 30, 2016, 1:17am

Hi all,

I am working with a client who is doing Revit coordination reports via copy monitor, this gets outputted as an HMTL file.

Our job is to go and find these objects and make the changes necessary as it is between a central file and multiple linked files, at least 15 different links spanning over 20 floors.

I was wondering if anyone knows how to make an HTML file work for Dynamo that I can extract the element ID from the file itself and then being able to select that element ID from within our central model itself via Dynamo.

Now his reports are coming through with the linked files element ID for the floors and our central file element ID for the floors.

Sorry if this sounds confusing but I think there should be a very straight forward method to doing this rather than having to go and individually go Select by ID every time we have a coordination report.

Kind regards,

Jean-Luc

Andrew_Hannell · April 3, 2016, 6:09pm

Hi Jean-Luc

I’d guess you would have to scrub the data first- html contains a lot of clutter & I don’t believe Dynamo could read it directly.

So there would probably need to be an intermediate step i.e cleaning up the data so it is in a nice tidy format that Dynamo can deal with.
This might be something like the ‘web query’ tool in Excel, or something more sophisticated. If the data is in an html table, it might not be too difficult.

Can post a sample of the html report.

Andrew

Konrad_K_Sobon · April 3, 2016, 7:51pm

Python has a few handful libraries for writing html crawlers. One that I have been using previously is called BeautifulSoup and it gets the job done. Have a look at this post: www.archi-lab.net

This should be exactly what you are looking for. The post is a tad dated, but I bet you can figure it out.

Dimitar_Venkov · April 4, 2016, 12:38am

You could also have a look in Spring nodes’ “ErrorReport.Parse” node. You might be able to adapt it to coordination reports.

Jeff_Shaver · April 4, 2016, 4:21pm

Why not just save the html to text from your browser and read/parse the resulting file from Dynamo?

Gui_Talarico · April 4, 2016, 6:42pm

Python’s Regex should be able to handle this pretty well.

Gui_Talarico · April 4, 2016, 6:44pm

import re

dataEnteringNode = IN
html = IN[0]

exp = r’(?<=id\s)([0-9]*)'
match = re.findall(exp, html)

if match:
OUT = set(match)
else:
OUT = ‘Not Found’

Konrad_K_Sobon · April 4, 2016, 8:49pm

Gui,

I like re! Regular Expression is definitely an option here. The reason i suggested beatiful soup is that it was specifically written with mind on parsing HTML documents while regex is more of a string parsing library. of course an html document is nothing more than a string, but still i thought beautiful soup to be a more elegant solution.

I would only add that to make your suggestion even better, i would just use the os module to read the file in, rather than copy pasting the contents manually. then you can use re to parse it and it will be quite an elegant solution.

Cheers!

Topic		Replies	Views
Selection - Selecting Elements using element IDs Revit	17	3723	June 23, 2021
List all User Selected Elements to Log File Revit dynamo	14	2585	July 18, 2018
Help With Querying Linked Revit Files for Shared Site Information Revit	9	828	August 27, 2018
Select Element from Linked file Revit python , dynamo	10	226	November 24, 2023
Dynamo Player picking element in linked file Revit	7	3744	February 8, 2017

Using dynamo to selected Element ID's via a HTML file

Related Topics