Skip to main content
Filter:
Reddit Data Pipeline Architecture
Web ScrapingData Engineering

Reddit Data Collector

Boston University needed large-scale Reddit data for a research project. DataPrism built an optimized pipeline to collect, clean, de-duplicate, and store subreddit, post, and moderator data in BigQuery.

Facebook Data Pipeline Architecture
Data EngineeringArtificial Intelligence

Facebook Data Pipeline using ChatGPT (for Knok’d)

Knok’d needed Facebook group data for its real estate listings platform. DataPrism built a Python and ChatGPT-powered pipeline to extract, clean, transform, and deliver the data in a structured format.

Book Consultation
Book Consultation