nordabiz

Author	SHA1	Message	Date
Maciej Pienczyn	637ec2fc75	feat: Skrypt do naprawy obrazków newsów z Google News	2026-01-15 06:19:53 +01:00
Maciej Pienczyn	c2205b0815	fix: Poprawione dekodowanie URL Google News + użycie source_domain	2026-01-15 06:10:59 +01:00
Maciej Pienczyn	8ead7798df	fix: Ładowanie DATABASE_URL z .env w skrypcie obrazków	2026-01-15 06:08:59 +01:00
Maciej Pienczyn	cf56fe7d8a	feat(zopk): Skrypt do pobierania obrazków dla newsów Strategia pobierania obrazków: 1. Rozwiń URL Google News do oryginalnego źródła 2. Pobierz og:image z meta tagów strony 3. Fallback: logo domeny (Clearbit API) 4. Fallback: favicon (Google Favicon API) Użycie: python scripts/fetch_news_images.py [--dry-run] [--limit N] Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-15 06:08:10 +01:00
Maciej Pienczyn	22e73e4f80	feat: Email DKIM/SPF/DMARC config + year_established data fill - Added release notes v1.19.0 with today's changes - Email: DKIM, SPF, DMARC configured for nordabiznes.pl - Data: year_established filled for 71/111 companies (64%) - Script: fix_year_established.py for KRS date migration Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-14 15:01:01 +01:00
Maciej Pienczyn	c8075e0872	feat: Add email test script for manual testing Script sends welcome emails to specified addresses for testing DKIM/SPF/DMARC configuration. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-14 14:09:17 +01:00
Maciej Pienczyn	3221740502	feat: Dodanie daty przystąpienia do Izby NORDA na profilu firmy - Nowa kolumna member_since w tabeli companies - Karta "Członek Izby NORDA od" na profilu firmy (niebieski kolor #3b82f6) - Wyświetlanie liczby lat w Izbie - Import 57 dat przystąpienia z pliku Excel od Artura - Skrypt import_member_since.py do importu dat Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-14 06:57:00 +01:00
Maciej Pienczyn	59c50e0267	fix: Handle None values in SEO audit result extraction Bug: When page fetch fails (SSL error), result['onpage'] is None. Using dict.get('key', {}) returns None when key exists with None value. Fix: Use 'or {}' pattern to handle both missing keys and None values. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-13 18:20:19 +01:00
Maciej Pienczyn	abe1cd38a1	feat: Add PKD codes and CEIDG owner data to company profiles - Add pkd_code, pkd_description columns for business activity classification - Add business_start_date column from CEIDG - Add owner_first_name, owner_last_name for JDG companies - Create import script scripts/import_ceidg_to_db.py - Add PKD card display in company profile template - Add owner section for JDG companies without KRS - Track SQL migrations in git (database/migrations/*.sql) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-13 16:07:03 +01:00
Maciej Pienczyn	f174f4d4da	feat: Link Users to Persons (KRS data) - Add person_id column to users table - Template shows person profile link when person_id exists - Add script to match and link users to persons by name Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-13 15:07:02 +01:00
Maciej Pienczyn	ffc6d8219f	feat: Add toggle button to hide/show test items on B2B board - Add is_test field to Classified model - Add test-item styling (opacity + gray border + badge) - Add yellow toggle button with localStorage persistence - Add script to mark existing classifieds as test	2026-01-13 13:08:11 +01:00
Maciej Pienczyn	08d6c0b069	feat: Add 'test' category for forum topics to separate test content - Add 'test' to ForumTopic.CATEGORIES with Polish label 'Testowy' - Add gray styling for test topics (badge + card opacity) - Add scripts to list and mark test topics	2026-01-13 11:48:08 +01:00
Maciej Pienczyn	9eae623d3e	feat: Add source tracking to events + import scripts - Add source and source_note fields to NordaEvent model - Create import_calendar_2026.py for NORDA calendar events - Create import_excel_members_2026_01_13.py for new members - Add .private/ to .gitignore (confidential materials) Imported 26 events from Kalendarz Izby NORDA 2026 (Artur Wiertel) Imported 31 new member companies from Excel Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-13 10:22:24 +01:00
Maciej Pienczyn	986360f7d5	feat: Add URL normalization and inline audit sections - Add normalize_social_url() function to database.py to prevent www vs non-www duplicates in social media records - Update update_social_media.py to normalize URLs before insert - Update social_media_audit.py to normalize URLs before insert - Add inline GBP Audit section to company profile - Add inline Social Media Audit section to company profile - Add inline IT Audit section to company profile Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-11 23:07:03 +01:00
Maciej Pienczyn	3f9273cff6	feat: Add company logos to search results, hide events section - Add company logo display in search results cards - Make logo clickable (links to company profile) - Temporarily hide "Aktualności i wydarzenia" section on company profiles - Add scripts for KRS PDF download/parsing and CEIDG API Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-11 15:32:53 +01:00
Maciej Pienczyn	f29987f635	auto-claude: 2.6 - Remove hardcoded password from docstring usage example	2026-01-10 12:55:34 +01:00
Maciej Pienczyn	c228716c0f	auto-claude: 2.6 - Replace hardcoded password in scripts/test_collaboration_matching.py with safe fallback	2026-01-10 12:54:39 +01:00
Maciej Pienczyn	914dac410e	auto-claude: 2.5 - Replace hardcoded password in scripts/seo_audit.py with safe fallback	2026-01-10 12:53:29 +01:00
Maciej Pienczyn	90f9401530	auto-claude: 2.4 - Replace hardcoded password in scripts/seo_report_generator.py with safe fallback	2026-01-10 12:52:01 +01:00
Maciej Pienczyn	b4dcca6d55	auto-claude: 2.3 - Replace hardcoded password in scripts/social_media_audit.py with safe fallback	2026-01-10 12:50:39 +01:00
Maciej Pienczyn	fa45b4b793	auto-claude: subtask-7-2 - Test collaboration matching Created comprehensive test suite for IT audit collaboration matching: 1. Unit tests (tests/test_it_audit_collaboration.py): - 12 tests verifying all 6 match types - Backup replication, shared licensing, Teams federation - Shared monitoring, collective purchasing, knowledge sharing - Edge cases for size parsing and similarity 2. Integration test script (scripts/test_collaboration_matching.py): - Creates test audits with matching criteria - Runs collaboration matching algorithm - Verifies matches saved to database All unit tests pass (12/12). Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-09 09:24:45 +01:00
Maciej Pienczyn	39cd257f4e	Fix YouTube detection overwriting valid matches - Add 'channel', 'c', 'user', '@' etc. to YouTube exclusion list - Add 'bold_themes', 'boldthemes' to Twitter/Facebook exclusions (theme creators) - Fix pattern matching loop to stop after first valid match per platform - Prevents fallback pattern from overwriting correct channel ID with 'channel' Fixes issue where youtube.com/channel/ID was being overwritten with youtube.com/channel/channel by the second fallback pattern. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-09 05:36:06 +01:00
Maciej Pienczyn	c319777d58	Social Media audit: progress bar improvements - Add detailed logging to SocialMediaAuditor (website scan, Brave search, results) - Slow down progress bar animation (400ms instead of 200ms) for better readability - Bold "ZNALEZIONO" text for found platforms - Display Google rating and review count in progress - Increase wait time before modal close (4 seconds) - Add console.log for debugging audit response Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-09 05:29:17 +01:00
Maciej Pienczyn	8fed190303	fix(social-audit): Convert opening_hours dict to JSON for JSONB column Fixes: psycopg2.ProgrammingError: can't adapt type 'dict' Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-09 05:14:01 +01:00
Maciej Pienczyn	5ed97ac1dd	auto-claude: subtask-5-1 - Fix opening_hours and photos data passing in audit_company Fixed a bug where google_opening_hours and google_photos_count were being fetched from the Google Places API but not passed through to the result dictionary correctly: - Changed 'opening_hours' key to 'google_opening_hours' to match what save_audit_result() expects - Added 'google_photos_count' to the result dictionary Verified with dry-run: INPI company now shows opening hours schedule and 10 photos count from Google Business Profile. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-08 23:08:19 +01:00
Maciej Pienczyn	aacf2cf54b	auto-claude: subtask-2-3 - Update save_audit_result() to store google_opening_hours and google_photos_count - Added google_opening_hours and google_photos_count to INSERT column list - Added corresponding placeholders to VALUES list - Added to ON CONFLICT UPDATE SET clause - Added to parameter dictionary reading from google_reviews result Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-08 23:00:22 +01:00
Maciej Pienczyn	5f2cfa06fd	auto-claude: subtask-2-2 - Update get_place_details() to return photos count - Add google_photos_count to result dictionary initialization - Extract photos count from API response using len(place['photos']) - Update logging to include photos count in output Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-08 22:59:04 +01:00
Maciej Pienczyn	5fa80f9efa	auto-claude: subtask-2-1 - Add 'photos' to fields list in GooglePlacesSearcher Added 'photos' field to the fields list in get_place_details() method to enable fetching business photos from Google Places API. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-08 22:58:04 +01:00
Maciej Pienczyn	06c22539d7	auto-claude: subtask-4-2 - Add --company-slug support and dotenv loading - Add --company-slug argument to social_media_audit.py for easier testing - Add get_company_id_by_slug() method to SocialMediaAuditor class - Add python-dotenv support to load .env file from project root - Create verify_google_places.py script for direct API testing Note: Full verification blocked - current API key (PageSpeed) doesn't have Places API enabled. Requires enabling Places API in Google Cloud Console for project NORDABIZNES (gen-lang-client-0540794446). Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-08 20:49:59 +01:00
Maciej Pienczyn	3d69d53550	auto-claude: subtask-3-4 - Run all tests and verify they pass Fixed bug in social media exclusion logic that was too aggressive. The substring check `any(ex in match.lower() for ex in excludes)` was incorrectly excluding valid usernames containing exclusion strings (e.g., 'testcompany' was excluded because it contained 'p'). Changed to exact match only to properly handle Instagram post URLs (`instagram.com/p/...`) without false positives on valid usernames. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-08 20:41:54 +01:00
Maciej Pienczyn	3bdbde1621	auto-claude: subtask-2-3 - Update SocialMediaAuditor to use GooglePlacesSearcher - Add google_places_searcher attribute to SocialMediaAuditor - Initialize GooglePlacesSearcher if GOOGLE_PLACES_API_KEY env var is set - Update audit_company() to use Places API directly when available - Fallback to Brave Search when API key not configured - Log which data source is being used for reviews Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-08 20:31:22 +01:00
Maciej Pienczyn	b389287697	auto-claude: subtask-2-2 - Replace placeholder search_google_reviews() method Implemented actual Google reviews data collection in BraveSearcher class: - Uses GooglePlacesSearcher to find company and get place details - Returns google_rating, google_reviews_count, opening_hours, business_status - Falls back to Brave Search API parsing when Google API key not available - Added _search_brave_for_reviews() helper for fallback implementation - Proper error handling and logging throughout Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-08 20:29:50 +01:00
Maciej Pienczyn	4110ef63b5	auto-claude: subtask-2-1 - Add GooglePlacesSearcher class to social_media_audit.py Implements GooglePlacesSearcher class with: - find_place() method: searches for business by name and city using Google Places findplacefromtext API - get_place_details() method: retrieves rating, review count, opening hours, business status, phone, and website Features: - Uses GOOGLE_PLACES_API_KEY environment variable - Comprehensive error handling (timeout, request errors) - Polish language locale support - Follows existing BraveSearcher class pattern Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-08 20:27:49 +01:00
Maciej Pienczyn	af003798a7	Napraw połączenie z bazą danych w skryptach SEO (localhost zamiast zewnętrznego IP)	2026-01-08 15:57:39 +01:00
Maciej Pienczyn	feaf5d5a49	auto-claude: 8.2 - Fix SQL ANY() to IN() for SQLite compatibility - Changed PostgreSQL-specific ANY(:ids) to use IN clause with dynamic placeholders for SQLite/PostgreSQL compatibility - Verified SEO audit dry-run extracts all metrics correctly: - HTTP status, load time, final URL - Meta title, H1 count, image analysis - Structured data detection - robots.txt, sitemap.xml, indexability - Overall SEO score calculation (95 for pixlab.pl) Note: Company ID 26 has no website configured, tested with ID 1 instead. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-08 09:21:12 +01:00
Maciej Pienczyn	15ddbba8b5	auto-claude: 7.1 - Create scripts/seo_report_generator.py that generates HTML reports and JSON exports Features: - Single company HTML reports with full SEO audit data - Batch HTML summary reports for multiple companies - JSON exports for integration with other tools - SEO recommendations based on audit findings - CLI interface with --company-id, --batch, --all selection - Output format options: --html, --json - Score visualization with color-coded badges - Core Web Vitals section with threshold indicators - Issues and recommendations sections - Statistics calculation for batch reports - Polish language support in reports Usage examples: - python seo_report_generator.py --company-id 26 --html - python seo_report_generator.py --all --html --output ./reports - python seo_report_generator.py --batch 1-10 --json 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-08 09:13:18 +01:00
Maciej Pienczyn	c24c545cfe	auto-claude: 4.3 - Add save_audit_result method with ON CONFLICT DO UPDATE - Enhanced save_audit_result method with complete column coverage - Added missing columns to idempotent upsert query: - broken_links_count (for future link checking) - viewport_configured (derived from meta viewport tag) - is_mobile_friendly (derived from viewport content) - has_hreflang (for international SEO detection) - All 45+ SEO columns now properly mapped for database upserts - ON CONFLICT (company_id) DO UPDATE ensures idempotent operations 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-08 08:00:12 +01:00
Maciej Pienczyn	c8eb0829d9	auto-claude: 4.2 - Add CLI argument parsing, progress logging, and error handling Enhanced scripts/seo_audit.py with comprehensive CLI improvements: CLI Arguments: - --company-id: Audit single company by ID - --company-ids: Audit multiple companies (comma-separated) - --batch: Audit range of companies (e.g., 1-10) - --all: Audit all companies - --dry-run: Print results without database writes - --verbose/-v: Debug output - --quiet/-q: Suppress progress output - --json: JSON output for scripting - --database-url: Override DATABASE_URL env var Progress Logging: - ETA calculation based on average time per company - Progress counter [X/Y] for each company - Status indicators (SUCCESS/SKIPPED/FAILED/TIMEOUT) Summary Reporting: - Detailed breakdown by result category - Edge case counts (no_website, unavailable, timeout, ssl_errors) - PageSpeed API quota tracking (start/used/remaining) - Visual score distribution with bar charts - Failed audits listing with error messages Error Handling: - Proper exit codes (0-5) for different scenarios - Categorization of errors (timeout, connection, SSL, unavailable) - Database connection error handling - Quota exceeded handling - Batch argument validation with helpful error messages 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-08 03:11:22 +01:00
Maciej Pienczyn	2bebb46f02	auto-claude: 4.1 - Create scripts/seo_audit.py with SEOAuditor class Implements SEOAuditor class following social_media_audit.py pattern: - __init__: Initialize database connection and analysis components - get_companies: Fetch companies by ID, batch, or all - audit_company: Full SEO audit (PageSpeed, on-page, technical) - save_audit_result: Upsert to company_website_analysis table - run_audit: Orchestration with progress logging and summary Features: - Integrates GooglePageSpeedClient for Lighthouse scores - Uses OnPageSEOAnalyzer for meta tags, headings, images, links - Uses TechnicalSEOChecker for robots.txt, sitemap, canonical - Calculates overall SEO score from weighted components - CLI support: --company-id, --batch, --all, --dry-run, --json 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-08 02:16:36 +01:00
Maciej Pienczyn	81fc27dfa9	auto-claude: 3.2 - Add TechnicalSEOChecker class to scripts/seo_analyzer.py Adds TechnicalSEOChecker class that performs technical SEO audits: - robots.txt: checks existence, parses directives (Disallow, Allow, Sitemap) detects if blocks Googlebot or all bots - sitemap.xml: checks existence, validates XML, counts URLs, detects sitemap index - Canonical URLs: detects canonical tag, checks if self-referencing or cross-domain - Noindex tags: checks meta robots and X-Robots-Tag HTTP header - Redirect chains: follows up to 10 redirects, detects loops, HTTPS upgrades, www redirects, and mixed content issues Includes: - 8 dataclasses for structured results (RobotsTxtResult, SitemapResult, etc.) - TechnicalSEOResult container for complete analysis - check_technical_seo() convenience function - CLI support: --technical/-t flag for technical-only analysis - --all/-a flag for combined on-page and technical analysis - --json/-j flag for JSON output 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-08 02:12:47 +01:00
Maciej Pienczyn	0c257f5e48	auto-claude: 3.1 - Create scripts/seo_analyzer.py with OnPageSEOAnalyzer Add comprehensive on-page SEO analyzer that extracts: - Meta tags (title, description, keywords, robots, viewport, canonical) - Open Graph metadata (og:title, og:description, og:image, etc.) - Twitter Card metadata (card type, site, creator, etc.) - Heading structure (h1-h6 counts, hierarchy validation) - Image alt text analysis (missing, empty, quality issues) - Link analysis (internal/external/nofollow/broken) - Structured data detection (JSON-LD, Microdata, RDFa) - Word count and document attributes (DOCTYPE, lang) Uses dataclasses for structured results following pagespeed_client.py pattern. Includes CLI interface for testing individual URLs. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-08 02:07:10 +01:00
Maciej Pienczyn	9f58e3f8e1	auto-claude: 2.1 - Create scripts/pagespeed_client.py with GooglePageSpeedClient Implements Google PageSpeed Insights API client with: - GooglePageSpeedClient class for making API calls - Exponential backoff retry logic (3 retries, 1-60s backoff) - RateLimiter class with daily quota tracking (25k req/day) - Quota persistence to .pagespeed_quota.json - Support for mobile/desktop strategies - Core Web Vitals extraction (LCP, FCP, CLS, TTFB) - Lighthouse audit scores (performance, accessibility, SEO, best-practices) - Structured dataclasses for results (PageSpeedResult, PageSpeedScore, CoreWebVitals) - Custom exceptions (QuotaExceededError, RateLimitError, PageSpeedAPIError) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-08 02:00:37 +01:00
Maciej Pienczyn	02fc67bf40	Initial commit	2026-01-01 14:01:49 +01:00

1 2 3

143 Commits