首页 / 实操指南 / ai-agent-data-classification

ai-agent-data-classification

综合教程 建议阅读时间: 5-8 分钟

16 de mayo de 2025 | 7 minutos

Cómo automatizar la clasificación de datos y los flujos de trabajo con IA paso a paso

John Hu, socio de Make con 20 años de experiencia en compras, muestra cómo clasificar datos desordenados utilizando un enfoque único de IA. Tomando las clasificaciones W-9 y NAICS como ejemplo, usar Make junto con IA ahorra horas durante el proceso de incorporación. ![AI Classification](__CODE_BLOCK_0__ The familiar proverb about solving big, unwieldy problems goes "How do you eat an elephant? One bite at a time." But as John Hu, President of [Axanexa](__CODE_BLOCK_1__ demonstrated in [one of Make's recent webinars](__CODE_BLOCK_2__ taking things bite-by-bite is also one of the best ways to ensure that AI gives you consistent, reliable answers. These accurate and dependable outputs can then be incorporated into broader workflows to automate away problems like classifying freeform information and data entry – problems that, until now, have required human input. Let's dive into what John shared about how to get the most from your AI, your Make AI Agents, and your scenarios overall though a step-by-step approach to prompting and problem-solving.

The problem: Transferring data from forms or flowcharts to your CRM

Procurement is more than just finding the right supplier for the job: once you've found the right contractor or business to partner with, the paperwork starts. You need to keep meticulous records to ensure that the right forms are issued, the right taxes are paid, and the right information gets to the right people in your organization and externally. If you're doing business in the United States, onboarding an individual contractor means requesting their [W-9 form](__CODE_BLOCK_3__ Partnering with a company means, among many other things, assigning a five- or six-digit [NAICS code](__CODE_BLOCK_4__ that classifies them into the proper line of business. Failing to do either properly means risking fines or legal consequences in the case of an audit, missing out on tax benefits, and/or being ineligible for lucrative government contracts. ![NAICS table](__CODE_BLOCK_5__ Unfortunately, most automated systems aren't adept at correctly identifying whether a form is a valid W-9 at all, much less which fields contain highly variable information such as names and tax identification numbers. And finding the right NAICS code is a matter of both knowing the business and how to navigate a multi-tiered flow chart with hundreds of possible outcomes. In both cases, transferring information from the contractor or company to your system has largely been a matter of manual data entry – a tedious task, but one that still requires the critical thinking and precision of slow and, let's face it, costly human workers. Attempts to cut this manual step out and speed up this portion of the onboarding process have been around for decades and have been fruitless.

"I've been in procurement for 20 years of my career, and I can tell you that this is a common problem, consistent problem across the board for the last 20 years." – John Hu

Luckily, AI can now automate your data entry and classification workflows. Here's how.

The tools: CRM, ChatGPT, and Make

As John showed, this seemingly complex task only needs a few basic tools at hand. He demonstrated using [Monday](__CODE_BLOCK_6__ CRM as his core app and [OpenAI's ChatGPT](__CODE_BLOCK_7__ as his AI, linking them together in Make. You'll need accounts to use all of these apps, but don't forget that you can [get started with Make](__CODE_BLOCK_8__ for free. If you're using a different CRM or AI, not to worry: Make supports over 2,000 apps. If this is the case, the specifics may differ slightly – especially when it comes to [fine-tuning your prompts](__CODE_BLOCK_9__ – but the general outline and certainly the tips below should still hold true.

The setup

Extracting W-9 info with AI

For this portion of the demo, John set up a relatively straightforward Make scenario. In just a few modules, he was able to take an image that had been uploaded to his Monday instance, feed it though an LLM AI for analysis, and send the results of that analysis back to his CRM to be acted upon further. ![Scenario](__CODE_BLOCK_10__ The crucial part of this workflow comes smack dab in the center, where the ChatGPT module makes its first appearance. By mapping one bundle – the image of a W-9 – from the preceding Monday modules onto the ChatGPT module and providing a one-sentence instruction to the AI, John was able to make this whole scenario work. So let's take a look at his prompt, cleaned up slightly:

Please respond ONLY in JSON { w9:"Yes or no}", name,tax_id,error:} if w9 being Yes if it's a valid W-9 and No if it's not

This is enough to provide an answer that can be mapped onto a [JSON module](__CODE_BLOCK_11__ which can then move on to the final step of updating Monday with the results. If it's a valid W-9 form, this final step will mean updating the Monday fields that correspond to the JSON outputs, namely filling in the contractor's name, tax ID number, and making a visual indicator that this form is cleared and ready to use. Impressively, this workflow can even work if an inappropriate form is uploaded. Using the same scenario and the same prompt, John's ChatGPT was able to correctly identify that it was not a valid tax form. While optional, he also added a final step that could provide valuable added information or just a bit of fun. By adding a second AI module filtered to run only when an invalid image was present, he was able to automatically find what _was_ in the image – in this case a snapshot of his daughter – and upload that info – in this case using the linguistic stylings of William Shatner – to Monday.

Classifying info with AI

But John wasn't done yet, and moved on to the matter of assigning a NAICS code to a fine-dining restaurant. The idea here was similar, but completing it successfully required not only drawing on a handful of external data sources but also navigating those sources in bite-sized sub-steps. But all along the way, the crucial component was ChatGPT's AI capabilities. The initial step took place in Monday. By first importing a list of all 600 or more valid NAICS codes at every step in the hierarchy – where 11 is the top-level category of Agriculture businesses, while 111 narrows that down to Crop production, 1111 narrows that to Oilseed and grain farming, and so on – John first made available all the possible codes that the AI could produce by the end of the scenario. He then began building his Make scenario by instructing an AI module to gather and summarize online information about a sample business – the fine-dining restaurant Alinea – based on a few basic pieces of information. Seemingly, it would then be straightforward to simply match this business summary to the right NAICS code. This, however, isn't the way the AI sees it. As John explains: "If you look at Alinea as a whole, if you look at the NAICS code as a whole, it could be in like 10 places and it's inconsistent. If you ask someone to choose 1 out of 10 which one it could be, well it could be any one of them. But if I say choose only 1 out of 3, there's a much better likelihood [of choosing the right answer] in choosing 1 out of 3 than 1 out of 10." This way of thinking informed the remainder of his multi-route scenario. John first instructed his AI to use its critical thinking skills to match the business summary to 1 of the 25 _top-level_ NAICS codes – more than an order of magnitude less than guessing from all of the hundreds of codes. This top-level code, 72, was funneled back to Monday, which his Make scenario drew on once again. Using a similar series of modules but with the prerequisite that the next code _must_ be within the top-level code, he again asked his AI to choose from just a handful of options, producing the code 722. "You can see it forking down the path of what it should be," in John's words. ![Information panel](__CODE_BLOCK_12__ After a few loops, his CRM was automatically updated with the specific – and accurate – NAICS code of 722511 for full-service restaurants.

How AI saves time, costs, and frustration

John's two scenarios straddled the line between deterministic and [non-deterministic automation](__CODE_BLOCK_13__ to accomplish things that simply wouldn't be possible without AI. In his W-9 example, ChatGPT used image recognition, optical character recognition (OCR), y dominio de JSON para reconocer, analizar e informar sobre el contenido de una imagen en _un solo paso_. No solo los intentos previos a la IA de automatizar esta tarea requerirían varios pasos en múltiples programas que probablemente arrojarían resultados en distintos formatos, sino que además lo harían con una precisión dudosa que habría que comprobar dos veces y ensamblar manualmente con intervención humana. Probablemente, llegado a ese punto, no merecería la pena intentar evitar la intervención humana en absoluto. Pero la IA hizo todo esto trabajando sobre una sola frase en lenguaje natural, del mismo modo que podrías formar a una persona de nivel inicial para la entrada de datos en la oficina. Asignar un código NAICS fue similar. De forma parecida a formar a un empleado de oficina administrativa en los códigos NAICS, que cambian constantemente —esencialmente un enorme diagrama de flujo—, y luego indicarle que investigue cada contratista que llegue, ChatGPT fue capaz de encontrar, sintetizar y cotejar datos cuando se le proporcionaron las fuentes de datos correctas. Las únicas diferencias eran que ChatGPT lo hizo al instante, con una menor probabilidad de errores y sin aburrirse ni preocuparse por su satisfacción laboral.

"Esto ahorrará cantidades indecentes de horas, dolores de cabeza, dinero, procesamiento…Este diminuto caso de uso…ahorrará el equivalente a quizá media hora de trabajo." – John Hu

Pon freno a las dificultades de la entrada de datos y la clasificación

Cuando se dispone de los datos adecuados, las instrucciones adecuadas y la mentalidad adecuada, AI automation) puede ser una forma potente de agilizar tareas importantes —aunque complejas y tediosas— como el registro de datos de empleo. Si estás listo para hacer más eficiente y rentable tu área administrativa con IA en Make, regístrate para obtener una cuenta gratuita o reserva una demo hoy mismo.

⭐ 核心工具推荐

执行本指南:你需要 Make.com 账户

本指南中的自动化流程均基于 Make 平台构建。注册免费账户即可直接复刻我们测试过的场景。

  • 即刻获得 1,000 免费额度
  • 完全还原文章中的可视化体验
  • 通过右侧按钮可直接绑定官网注册
  • 无需懂代码即可跑通流程
★★★★★
4.6/5 (G2 评分, 2,400+ 评价)
* 通过上述链接注册不产生额外费用
/>