1001 Freelance Projects
Latest Projects from Freelance Marketplaces
Today is: 01-May-2025 16:20 GMT
View Project
View this project in detail (Note: you will be redirected to external marketplace)
Project title: AI specialist for advanced scraping tool for housing websites
Posted by: External project from PeoplePerHour
Started: 17-Dec-2024 12:34 GMT
Description: I am looking for an AI specialist with extensive experience in AI to develop a Windows Service in C# that can do the following:
Every day, visit a list of approximately 800 URLs of real estate agency websites and navigate through the pages to search for newly listed properties added by the agencies.

Next, these property pages must be read, and the relevant data extracted to be stored in a fixed format in tables on an SQL server.

A number of data fields are mandatory, such as:

The direct URL of the property page within the real estate agency's website (to enforce uniqueness)
The city where the property is located
The street where the property is located
The property type, where the choice comes from our fixed list: entire home, apartment, studio, etc. The engine must select the closest match from our list
The number of rooms
The monthly rental price
Whether this price includes or excludes service charges
The date the property is available
The surface area in square meters
A list of URLs of the photos associated with the property
Additionally, there is a list of optional fields we would like to retrieve if the information is available:

Municipality
District
Postal code
House number
Number of bedrooms
Number of bathrooms
Year of construction
Is there a: garden, garage, rooftop terrace, balcony?
Condition of the property
Is the property furnished?
...and so on
A complete list will be provided.

The challenge lies in the fact that each real estate agency uses a different paging method and different page layouts. Furthermore, some agencies include all the information in one block of text, while others display much of the data in columns. This can also change unexpectedly. Therefore, the software must be resilient and capable of understanding how to navigate through the pages to look for new properties.

A second challenge is that some agencies include photos of other nearby properties under the details of a specific property. The tool must recognize that these photos do not belong to the property in question and should ignore them.

Preferably, we would use—due to cost considerations—an AI model that does not rely on a commercial API, unless doing so offers such significant benefits that it is worthwhile.

I would love to hear about your experience and how you would approach this. Specifically: which AI method/engine you would use and the flow of the software.
Project ID: 3413051
Project category:
Project budget:
View this project in detail (Note: you will be redirected to external marketplace)
Last Projects / Browse Projects
  Project Started
SMALL CHARITY DINNER - WEST LONDON (SAT 10TH MAY) 4PM -8PM 01-May-2025
15:12 GMT
Explainer infomercial (cartoon) 01-May-2025
15:01 GMT
Commercial AND litigation solicitor needed 01-May-2025
14:49 GMT
Remote Property Manager 01-May-2025
14:05 GMT
Analyzing a MySQL crash 01-May-2025
13:58 GMT
Social Media video editing 01-May-2025
13:55 GMT
Capability Statement 01-May-2025
13:51 GMT
Bug Fixing for iOS Mobile Application 01-May-2025
13:37 GMT
Self-Introduction Video for Hospitality Interns 01-May-2025
13:20 GMT
Project Brief for Andy's Subs Logo and Brand Definition 01-May-2025
13:06 GMT
Comprehensive Design Package for Food Truck Business 01-May-2025
12:48 GMT
Content turning into a well-designed PDF 01-May-2025
12:47 GMT
Security Researcher (ZK/Rollups) 01-May-2025
12:24 GMT
Reliable & Detail-Oriented Virtual Assistant 01-May-2025
12:23 GMT
Concept Designer for Small Modern House (Sketch + 3D Model) 01-May-2025
12:23 GMT
Browse All Projects
Projects by Skills ...
Projects for 'android'
Projects for 'ajax'
Projects for 'asp'
Projects for 'aspnet'
Projects for 'cms'
Projects for 'cpp'
Projects for 'csharp'
Projects for 'css'
Projects for 'delphi'
Projects for 'design'
Projects for 'drupal'
Projects for 'excel'
Projects for 'facebook'
Projects for 'flash'
Projects for 'html'
Projects for 'java'
Projects for 'javascript'
Projects for 'joomla'
Projects for 'iphone'
Projects for 'mysql'
Projects for 'photoshop'
Projects for 'php'
Projects for 'python'
Projects for 'ruby'
Projects for 'seo'
Projects for 'sql'
Projects for 'sysadm'
Projects for 'translate'
Projects for 'typing'
Projects for 'twitter'
Projects for 'vbnet'
Projects for 'xml'
Projects for 'wordpress'
Projects for 'writing'
Read RSS feeds ... New!
RSS feed for 'android'
RSS feed for 'ajax'
RSS feed for 'asp'
RSS feed for 'aspnet'
RSS feed for 'cms'
RSS feed for 'cpp'
RSS feed for 'csharp'
RSS feed for 'css'
RSS feed for 'delphi'
RSS feed for 'design'
RSS feed for 'drupal'
RSS feed for 'excel'
RSS feed for 'facebook'
RSS feed for 'flash'
RSS feed for 'html'
RSS feed for 'java'
RSS feed for 'javascript'
RSS feed for 'joomla'
RSS feed for 'iphone'
RSS feed for 'mysql'
RSS feed for 'photoshop'
RSS feed for 'php'
RSS feed for 'python'
RSS feed for 'ruby'
RSS feed for 'seo'
RSS feed for 'sql'
RSS feed for 'sysadm'
RSS feed for 'translate'
RSS feed for 'typing'
RSS feed for 'twitter'
RSS feed for 'vbnet'
RSS feed for 'xml'
RSS feed for 'wordpress'
RSS feed for 'writing'
New!
Проекты на русском
(Projects in Russian)

Long URL:
www.1001freelanceprojects.com
Mobile version:
m.1001fp.com
Copyright © 2005-2024 1001 Freelance Projects