Name: Web Search
Author: JansenAnalytics

Web Search & Fetch

Search via DuckDuckGo (no API key needed)

# HTML lite search
curl -s "https://html.duckduckgo.com/html/?q=QUERY" | grep -oP '(?<=<a rel="nofollow" class="result__a" href=").*?(?=")'

Fetch URL content

# Readable text extraction
curl -s URL | python3 -c "
import sys, html
from html.parser import HTMLParser
class T(HTMLParser):
    def __init__(self):
        super().__init__()
        self.text = []
        self.skip = False
    def handle_starttag(self, tag, attrs):
        if tag in ('script','style','nav','header','footer'): self.skip = True
    def handle_endtag(self, tag):
        if tag in ('script','style','nav','header','footer'): self.skip = False
    def handle_data(self, data):
        if not self.skip: self.text.append(data.strip())
t = T()
t.feed(sys.stdin.read())
print('\n'.join(filter(None, t.text))[:5000])
"

Web Search & Fetch

Search via DuckDuckGo (no API key needed)

# HTML lite search
curl -s "https://html.duckduckgo.com/html/?q=QUERY" | grep -oP '(?<=<a rel="nofollow" class="result__a" href=").*?(?=")'

Fetch URL content

# Readable text extraction
curl -s URL | python3 -c "
import sys, html
from html.parser import HTMLParser
class T(HTMLParser):
    def __init__(self):
        super().__init__()
        self.text = []
        self.skip = False
    def handle_starttag(self, tag, attrs):
        if tag in ('script','style','nav','header','footer'): self.skip = True
    def handle_endtag(self, tag):
        if tag in ('script','style','nav','header','footer'): self.skip = False
    def handle_data(self, data):
        if not self.skip: self.text.append(data.strip())
t = T()
t.feed(sys.stdin.read())
print('\n'.join(filter(None, t.text))[:5000])
"

Web Search

Web Search & Fetch

Search via DuckDuckGo (no API key needed)

Fetch URL content

Web Search

Web Search & Fetch

Search via DuckDuckGo (no API key needed)

Fetch URL content

Alternative: lynx (if installed)

Feishu Doc

Summarize

Nano Pdf

Diffs

Customs Trade Compliance

Nutrient Document Processing