Who this is for?
Extract & Summarize Yelp Business Review is an automated workflow that extracts the Yelp business reviews using Bright Data Web Unlocker, processes and formats the raw data, summarizes using Google Gemini’s LLM, and forwards the concise summary with the review response to a specified webhook endpoint.
This workflow is tailored for:
– Local SEO Specialists who need structured insights from Yelp reviews to optimize listings.
– Business Owners wanting quick summaries of what customers love or complain about.
– Reputation Managers who monitor brand sentiment and identify customer pain points.
– Data Analysts & Researchers extracting Yelp review patterns at scale.
– AI Product Builders needing clean Yelp review data as input for their LLMs or recommender systems.
What problem is this workflow solving?
Yelp reviews are rich in customer sentiment but messy to work with manually. This workflow solves:
– The pain of scraping Yelp review content manually.
– The challenge of building structured data with the summary.
– The need for structured outputs suitable for analysis, reports, or AI input.
What this workflow does
This automated pipeline does the following:
– Bright Data Integration: Queries Yelp and scrapes business listing data using Bright Data’s Web Unlocker.
– Structured Data Formatting: Formats the Yelp review data to a structured response in JSON format.
– Google Gemini Summarization: Sends the cleaned reviews to Google Gemini to summarize.
– Output Delivery: Returns the structured response with the concise summary over the webhook endpoint.
How to customize this workflow to your needs
This workflow is built to be flexible – whether you’re a market researcher, entrepreneur, or data analyst. Here’s how you can adapt it to fit your specific use case:
– Target Specific Business Categories: Update the Yelp Business Review input to scrape different businesses like gyms, salons, etc.
– Limit Reviews: Add filters by description, location, page range to get the top reviews.
– Tweak the Data Extraction Node: Update the Structured Data Extractor node Output Parser for building the JSON response with the appropriate fields or attributes.
– Tweak the Summarization Prompt: Modify the Gemini prompt to generate a comprehensive summary.
– Send Output to Other Destinations: Replace the Webhook URL to forward output to Google Sheets, Airtable, Slack or Discord, or custom API endpoints.