README.md |
Montreal Building Data
Contributors
Kartikay Sharma (kartikay.sharma@concordia.ca)
Koa Wells (kekoa.wells@concordia.ca)
Dataset Description
The following dataset contains the collection of buildings in the geojson format for the city of Montreal. The dataset is created by combining multiple open-source datasets provided from the Canadian federal government and the Montreal city government. The dataset is split into several different files. The first file, name contains the entire dataset for the buildings of Montreal. The remainig files contain the buildings sorted by admistrative boundary/district.
Data Sources
- Montreal Property Assesment Units
- Montreal 3D Buildings (LOD2 model with textures)
- Montreal Aerial Lidar Dataset
- NRCAN Building Footprints
- Administrative boundaries of the agglomeration of Montréal (boroughs and related cities)
List of dataset changes
- Removed redundant building footprint vertices
- Removed duplicate buildings
Known issues with dataset
Dataset Information
This dataset contains 3d building in multipatch format for 6 teritorries of Montreal city with their respecitve construction year and building use type. The Administrative Boundaries are:
- Anjou:
- Pointe-Claire:
- Rosemont-La Petite-Patrie:
- Kirkland:
- Westmount:
- Hampstead:
- Mercier-Hochelaga-Maisonneuve:
- Senneville:
- Le Sud-Ouest:
- Rivière-des-Prairies-Pointe-aux-Trembles:
- Sainte-Anne-de-Bellevue:
- Le Plateau-Mont-Royal:
- Verdun:
- Dollard-Des Ormeaux:
- Montréal-Est:
- Baie-D'Urfé:
- Lachine:
- Côte-des-Neiges-Notre-Dame-de-Grâce:
- Villeray-Saint-Michel-Parc-Extension:
- L'Île-Dorval:
- Côte-Saint-Luc:
- Beaconsfield:
- Pierrefonds-Roxboro:
- Montréal-Nord:
- Mont-Royal:
- Montréal-Ouest:
- Ahuntsic-Cartierville:
- Saint-Léonard:
- Outremont:
- Ville-Marie:
- L'Île-Bizard-Sainte-Geneviève:
- Dorval:
- Saint-Laurent:
- LaSalle:
Côte-des-Neiges–Notre-Dame-de-Grâce: 20100 buildings Outremont:3529 buildings Le Plateau-Mont-Royal:14326 buildings Le Sud-Ouest: 9926 buildings Verdun: 9001 buildings Ville-Marie: 8371 buildings
Meta data
- "ID_UEV": unique building identifier
- "ANNEE_CONS": year of construction
- "CODE_UTILI": building usage type
- "ADMIN_BOUNDARY": adminstrative boundary the building belongs to
Yes, these file also contains meta data information available for understanding what different attributes means. the same can be accessed through:
- For 3D building multipatches: donnees.montreal.ca/ville-de-montreal/batiment-3d-2016-maquette-citygml-lod2-avec-textures2#territories
- For building use, and other semantic data: https://donnees.montreal.ca/ville-de-montreal/unites-evaluation-fonciere
If No, whether the user created one? Or is there any source to understand the data (if the info is available)
Any psuedonames are used to represent the data attribites? If so, info is available explaining this psuedonames are available?
Data available (mandatory)
Not applicable
For example:
Time stamp: Unix/Epoch time. Date format: MM/DD/YYYY HH:MM or YYYY-MM-DD HH:MM:SS or..... (This is an important info for processing the dataset later)
Weather related data - Solar radiaition (Unit), Outdoor Temperature (Unit), Wind Speed (Unit), etc.,... (If possible information on whether weather parameters are measured from a local weather location or from indvidual homes could be helpful)
Energy related data - For example Plug load (Unit), Lighting Load (Unit), HVAC (Unit). Please provide whether the data available is the total energy data or sub-metered data
Gas measurement data
Hot water data, etc....
Missing values (mandatory)
Yes, these dataset contiains many missing values which are currently empty cells, but work is going on to either remove the empty columns or change them with NA
Missing value format in the dataset: 'Na' or 'NaN' or '-99' or any other formats
Continously missing values or sparsely missing values or No missing values
Data Cleaning (If it is processed dataset, it is mandatory to provide this info)
Whether missing values are processed? If so, what is the technique (Moving average, any machine learning techniques, etc...)
Any date/data is excluded because of continously missing values?
Data processing (If it is processed dataset, it is mandatory to provide this info)
Any information related to data aggregation (example: minutes to hour), data transformation (example: min-max, Z-transformation), outlier detection (example: inter-quartile range, etc..)
Supporting data (optional)
Any other information that can be useful to the new users. For example
There is one other zip file with supporting information in. This includes:
- documents that detail measurements and data visualisation (one per house)
- appliance details, images and inventory
- document that describes the data processing methods and a zip file of the Octave functions used
- document that details the measurement device calibration
- floor plans, dimensions and PIR/DOR locations
- house construction
- house occupant details
- installation data and meter readings
- a spreadsheet that details all measurements taken on the project (used to create the device ID in the header files)
Alert/Caution (Mandatory)
Do's & Dont' of the dataset
Any inherent problem associated with the data, etc...