Laws of Bangladesh and AI search(AutoRAG) by Cloudflare

Sayed | সাঈদ

27 Oct 2025 — 2 min read

Photo by Tingey Injury Law Firm / Unsplash

Laws of Bangladesh are available at the government website here: http://bdlaws.minlaw.gov.bd/laws-of-bangladesh.html

Goals:

Make the law content to be able to feed into AI model and train them.
Or, make RAG(Retrieval-Augmented Generation) to answer question from the data source(in this case, Laws of Bangladesh)

What I did:

I found that someone already made few models here, here and here. And, a Kaggle dataset. I could use those but I didn't. I scraped the whole site: http://bdlaws.minlaw.gov.bd/laws-of-bangladesh.html
Using the scraped data I built a website using Mintilfy CMS(Documentation Management software). I could use opensource documentation software like Docurus, mkDocs etc. But, I choosed to use Mintlify as it's free plan was enough for me to host the Laws. Also, Mintlify offers custom domain for free. I didn't take extra hassle to host them.
Mintlify offer AI search using RAG but it's in their paid plan.
Then, I discovered AI search (AutoRAG )by Cloudflare. It crawls all the content using the sitemap: https://laws.sayed.app/sitemap.xml and save them in R2 bucket.

Cloudflare AI search download every page and save them in a R2 bucket and make vector db from these pages. Which is not ideal and efficient for https://laws.sayed.app
I used @cf/baai/bge-m3 model which was default. Model Documentation here. But, this model is not efficient and provide wrong answer! For test purposes, I used this model.
All the Laws are also available in markdwon format from mintlify as llms.txt , llms-full.txt and Github repo. The llms.txt and llms-full.txt can be used as source for RAG. I may implement it on cloudflare later.

Here some stats from cloudflare:

Idk, if these traffic are real or not but many AI crawler do crawl the site. Beside my root zone(sayed.app) have few subdomain so, traffic from other site may count.

Thanks for reading.

Note: This article was not written by any AI and this article may or may not help anyone.

ট্র্যাকারবিডিঃ কী এবং কেন?

ট্র্যাকার.বিডি কী? ট্র্যাকার.বিডি হলো বাংলাদেশের জন্য তৈরি একটি বিনামূল্যের ওয়েব টুলস প্ল্যাটফর্ম। এখানে একাধিক কাজ একটাই

QnA about the Personal Data Protection Ordinance, 2025 (ব্যক্তিগত উপাত্ত সুরক্ষা অধ্যাদেশ, ২০২৫)

You all know, Government of Bangladesh approved the the Personal Data Protection Ordinance, 2025 which define/provide information about how any entity will use/handle/process personal data. Here is the pdf link. 1. Can I delete my account from any Bangladeshi website/business/service/company? Yes. The Ordinance grants

Comprehensive Risk Assessment of Rooppur Nuclear Power Plant: Water Security and Environmental Impact Analysis on the Padma River

Cover Credit: https://www.atomic-energy.ru/

Vertical Extension of Civil Engineering Building

Abstract This thesis explores the structural design and feasibility of a vertical expansion project involving an existing three-story Reinforced Cement Concrete (RCC) building constructed in 1968, currently in continuous use by a Civil Engineering de...

Read more

ট্র্যাকারবিডিঃ কী এবং কেন?

QnA about the Personal Data Protection Ordinance, 2025 (ব্যক্তিগত উপাত্ত সুরক্ষা অধ্যাদেশ, ২০২৫)

Comprehensive Risk Assessment of Rooppur Nuclear Power Plant: Water Security and Environmental Impact Analysis on the Padma River

Vertical Extension of Civil Engineering Building