ARTICLE
Data Extraction Made Easy: Using AI to Populate Missing Company Information

Data Extraction Made Easy: Using AI to Populate Missing Company Information

Have you ever been faced with a mountain of data, only to realize that some of the fields are missing valuable information? If so, you’re not alone. As marketers, we rely heavily on data to make informed decisions, but sometimes that data can be incomplete or inaccurate. Fortunately, there is a solution: artificial intelligence.

Leveraging AI for Data Enrichment with ChatGPT

Recently, we were approached by a client at Ivenium Marketing with a problem that’s all too familiar – missing information on their list of companies. They had the basics, but the essential information like industry, revenue, and headcount were missing. In the past, filling these gaps would have required hours of tedious manual work or a pricey investment in data append services. Instead, we decided to put AI to the test.

Using some AI wizardry, we automated the entire process, providing our client with the missing data in record time. While the results weren’t 100% perfect, they were still incredibly helpful. Who knew our machines could play detective and dig up such useful information?

The Power of AI in Data Enrichment

If you find yourself in a similar situation, don’t spend your valuable time on laborious data-entry tasks. Let us show you how AI can turn a headache into a breeze and even provide you with some entertainment along the way. And the best part? You can achieve impressive results for a fraction of the cost of traditional data append services.

We provided ChatGPT with a clear prompt to fill in missing information in our dataset. Specifically, we had several CSV files containing values for Industry, Sub-Industry, Company Size, and Company Revenue.

Prompt: I need you to fill in missing information in our dataset. Specifically, we have several CSV files containing values for Industry, Sub-Industry, Company Size, and Company Revenue. Please use the files to fill in missing values for these columns in our dataset. 

As for the Website column, we don’t have specific instructions for how to populate it, so please use your best judgement and skills to fill in this information. 

I’m going to load the CSV files now so you can get started. 

ChatGPT: Sure, I’m ready to assist you with filling in the missing information in your dataset. Please go ahead and load the CSV files, and I’ll get started on processing the data to fill in the missing values. 

Prompt: Can I upload the files here? 

ChatGPT: Sure, here’s a summary of the possible ways a user can share CSV files with ChatGPT: 

1. Copy and paste the contents of each file into our chat window if the CSV files are small enough*

*Copy and paste these CSV files into the chat windowlike it's 1999! Sure, it might take a bit longer, but ChatGPT will appreciate the personal touch. Plus, it's the best way to ensure your data doesn't end up floating in cyberspace like an astronaut's sock.

2. Email the CSV files to yourself and then provide the information needed to access the files to ChatGPT*

‍*Well…don't give ChatGPT access to your email unless you want it to start sending emails to all your contacts with subject lines like 'I am a robot, beep bop!' and 'Greetings, fellow humans!'. Trust us, your reputation will never recover.  

3. Upload the CSV files to a cloud storage service such as Google Drive, Dropbox, or OneDrive and then share the link to the file with ChatGPT* 

*Warning! If you give ChatGPT a link to a publicly accessible file, it might stare at it blankly, scratching its virtual head, and wondering why the file does not magically appear in its chat window. Remember, ChatGPT can't upload files to the cloud and links can be a tricky business. So, let's stick to plain old copy-pasting, shall we?

Check out how ChatGPT improved our dataset accuracy from 42% to 91%!

For the purposes of this article, we limited the sample size for this AI-powered data enrichment to just 30 companies to stay within the query limit. We chose to use publicly available data from Wikipedia to build our dataset of companies and their corresponding industry, sub-industry, company size, company revenue, and website.

It's quite possible that ChatGPT's performance will vary with larger datasets or different types of data. But based on what we've seen so far, I'm pretty impressed with its capabilities. Ready to see how ChatGPT performs when put to the test Buckle up because the results may surprise you.

Our initial data set had a fill rate of only 42%,meaning that more than half of the cells were empty. The existing field values were accurate, but missing data was considered invalid. We provided ChatGPT with standard, picklist values for industry, subindustry, company size, and company revenue, which it used to fill in the gaps.

After applying the AI algorithm, we were delighted to see a significant boost in our dataset’s fill rate, resulting in a remarkable 100% rate. However, we must acknowledge that our accuracy rate is 91% due to some errors and non-standard values that were provided within the response. Nevertheless, the progress we made was substantial and proved to be a great time and effort saver compared to manual methods of filling in missing data. 

Potential Benefits and Risks

Nisi quis eleifend quam adipiscing vitae aliquet bibendum enim facilisis gravida neque velit euismod in pellentesque massa placerat volutpat lacus laoreet non curabitur gravida odio aenean sed adipiscing diam donec adipiscing tristique risu amet est placerat in egestas erat imperdiet sed euismod nisi.

“Nisi quis eleifend quam adipiscing vitae aliquet bibendum enim facilisis gravida neque velit euismod in pellentesque massa placerat volutpat lacus.”

Bonus & Summary

What we weren’t really expecting, when asked to fill in missing information, ChatGPT went beyond the provided CSV files and generated a new set of companies, not originally included in the dataset. The next thing you know, ChatGPT is going to ask for a raise and take over our jobs. Just kidding, we’re pretty sure ChatGPT doesn’t need a raise or a job. It’s just happy to help us mere mortals with our data woes… 

Well, that’s a wrap on our exploration of data enrichment with ChatGPT. We hope you found this post as enlightening
as we did. At Ivenium Marketing, we’re always on the lookout for innovative tools to make our lives easier and more fun. With ChatGPT and other AI tools, we’re excited about the new opportunities and insights we can uncover.

We’d love to hear your thoughts on this post. Have you used ChatGPT or other AI tools in marketing automation? Share your experiences with us!

For more insights on AI and marketing automation, check out our related and upcoming blog posts.

Keep on

ReadingReading
Get in touch and start growing your startup today
Book a free consultation
Measurable ROI
Black bullet point
Partner-driven
Black bullet point
Strategic Alignment
Black bullet point
Measurable ROI
Black bullet point
Partner-driven
Black bullet point
Strategic Alignment
Black bullet point
Measurable ROI
Black bullet point
Partner-driven
Black bullet point
Strategic Alignment
Black bullet point
Measurable ROI
Black bullet point
Partner-driven
Black bullet point
Strategic Alignment
Black bullet point
Measurable ROI
Black bullet point
Partner-driven
Black bullet point
Strategic Alignment
Black bullet point
Measurable ROI
Black bullet point