Recently, we’ve spent some time talking about the challenges of working with product data on the internet. Jatin Kakkar, our VP of Product Management, has given a webinar and written three blog posts outlining the challenges we’ve seen with dirty data.
To put it in Jatin’s words:
“Indix collates product data from many sources like various web properties, data feeds acquired from retailers, brands, and other partners. This huge load of data goes into the Indix spin cycle which organizes the data around key pivots like a standardized brand dictionary, and a common category taxonomy across the sources. Next, it is refined to drop data which does not meet strict quality guidelines. Finally, this data is structured to a simple to understand JSON format and indexed.”
Now, we’ll dig into ingestion and organization and talk about the underlying technology and architecture behind the way Indix does it in our upcoming webinar on “Ingesting and Structuring Product Data from the Web.”
View our webinar – “Ingesting and Structuring Product Data from the Web.”
Also published on Medium.