feat: guidelines

feat: add logs
feat(pierre): sanity check
2026-02-13 18:33:21 +01:00 · 2026-02-13 18:04:09 +01:00 · 2026-02-13 17:35:39 +01:00 · 2026-02-13 17:09:18 +01:00 · 2026-02-13 14:31:20 +01:00 · 2026-02-13 09:30:02 +01:00
25 changed files with 1408 additions and 214 deletions
@@ -0,0 +1,171 @@
+---
+layout: default
+---
+
+# AGENTS.md – Repository Guidelines for Pierre Bot
+
+> This document is consulted by autonomous coding agents (including the
+> Open‑Source `opencode` agent) when they read, modify, test, or lint the
+> code‑base. It contains the canonical commands for building, testing, and
+> lint‑ing the project as well as a concise style guide that all Go code must
+> follow. The file is deliberately kept around **150 lines** to be readable for
+> both humans and LLMs.
+
+---
+
+## 1️⃣ Build / Run Commands
+
+| Command                                       | Description                                         |
+| --------------------------------------------- | --------------------------------------------------- |
+| `go build -o bin/pierre ./cmd/pierre/main.go` | Build the binary into `./bin/pierre`.               |
+| `go run ./cmd/pierre/main.go --help`          | Run the CLI with the help flag (fast sanity check). |
+| `./bin/pierre --version`                      | Verify the built binary reports its version.        |
+
+> **Tip for agents** – always prefer the `./bin/pierre` binary over invoking `go run` in CI or tests; it guarantees the same compilation flags.
+
+---
+
+## 2️⃣ Test Commands
+
+The project uses the standard Go testing framework.
+
+| Command                               | Use case                                                             |
+| ------------------------------------- | -------------------------------------------------------------------- |
+| `go test ./...`                       | Run **all** unit and integration tests.                              |
+| `go test ./... -run ^TestJudgePR$`    | Run a **single** test (replace `TestJudgePR` with the desired name). |
+| `go test -cover ./...`                | Run tests and emit a coverage report.                                |
+| `go test -run ^TestName$ -count=1 -v` | Verbose, non‑cached run for debugging a single test.                 |
+| `go test ./... -bench .`              | Execute benchmarks (if any).                                         |
+
+> **Agents should never** invoke `go test` with `-parallel` flags; the default parallelism is sufficient and ensures deterministic ordering for our table‑driven tests.
+
+---
+
+## 3️⃣ Lint / Static Analysis
+
+We rely on two lightweight built‑in tools plus optional `golangci-lint` when
+available.
+
+| Tool                       | Command             | Description                                                                                                                                                 |
+| -------------------------- | ------------------- | ----------------------------------------------------------------------------------------------------------------------------------------------------------- |
+| `go vet ./...`             | `go vet ./...`      | Detect obvious bugs, misuse of `fmt.Printf`, etc.                                                                                                           |
+| `staticcheck ./...`        | `staticcheck ./...` | Deeper static analysis (must be installed via `go install honnef.co/go/tools/cmd/staticcheck@latest`).                                                      |
+| `golangci-lint` (optional) | `golangci-lint run` | Runs a configurable suite of linters. Install with `brew install golangci-lint` or `go install github.com/golangci/golangci-lint/cmd/golangci-lint@latest`. |
+
+**Agent tip** – when `golangci-lint` is present, run it after `go vet` to catch style
+issues early. The CI pipeline (see `.github/workflows/*.yml` if added later) will
+use the same commands.
+
+---
+
+## 4️⃣ Code‑Style Guidelines (Go)
+
+All changes must satisfy the following conventions. They are enforced by
+`go fmt`, `go vet`, and the optional `golangci-lint` config. The code does not have
+to be backwards compatible.
+
+### 4.1 Formatting & Imports
+
+- Run `go fmt ./...` before committing. The repository uses **tab‑indented** Go code.
+- Imports are grouped in three blocks, separated by a blank line:
+  1. **Standard library**
+  2. **External dependencies** (e.g., `github.com/alecthomas/kong`)
+  3. **Internal packages** (e.g., `github.com/yourorg/pierre/internal/...`)
+- Within each block, imports are sorted alphabetically.
+- Use `goimports` (or `golangci‑lint --fast`) to automatically fix import ordering.
+
+### 4.2 Naming Conventions
+
+| Entity                | Rule                                                                   |
+| --------------------- | ---------------------------------------------------------------------- |
+| Packages              | lower‑case, no underscores or hyphens.                                 |
+| Files                 | snake_case (`*_test.go` for tests).                                    |
+| Types / Structs       | PascalCase, descriptive (e.g., `ReviewConfig`).                        |
+| Interfaces            | End with `er` when appropriate (e.g., `ChatAdapter`).                  |
+| Variables / Constants | camelCase for locals, PascalCase for exported.                         |
+| Functions             | PascalCase if exported, camelCase if private.                          |
+| Constants             | Use `CamelCase` for groups, `CONST_NAME` only for truly global values. |
+| Test Functions        | `TestXxx` and optionally `BenchmarkXxx`.                               |
+
+### 4.3 Error Handling
+
+- Errors are **never** ignored. Use the blank identifier only when the value is truly irrelevant.
+- Wrap contextual information using `fmt.Errorf("context: %w", err)` or `errors.Wrap` if the project imports `github.com/pkg/errors` (currently not used).
+- Return errors **as soon as** they are detected – "guard clause" style.
+- In public APIs, prefer error values over panics. Panics are limited to unrecoverable
+  programming errors (e.g., nil‑pointer dereference in `init`).
+
+### 4.4 Documentation & Comments
+
+- Exported identifiers **must** have a preceding doc comment beginning with the name (e.g., `// JudgePR reviews a PR and returns comments.`).
+- Inline comments should be sentence‑case and end with a period.
+- Use `//go:generate` directives sparingly; they must be accompanied by a test that ensures the generated file is up‑to‑date.
+
+### 4.5 Testing Practices
+
+- Keep tests **table‑driven** where possible; this yields concise, readable test suites.
+- Use the `testing` package only; avoid third‑party test frameworks.
+- Prefer `t.Fatalf` for fatal errors and `t.Errorf` for non‑fatal assertions.
+- When comparing complex structs, use `github.com/google/go-cmp/cmp` (already in `go.mod`).
+- Test coverage should be **≥ 80 %** for new code.
+- All tests must be **deterministic** – no reliance on external services; use mocks or fakes.
+
+### 4.6 Dependency Management
+
+- All dependencies are managed via Go modules (`go.mod`).
+- Run `go mod tidy` after adding/removing imports.
+- Do not commit `vendor/` unless a vendoring strategy is explicitly adopted.
+
+### 4.7 Logging & Output
+
+- Use the standard library `log` package for user‑visible output.
+- For structured logs (debug mode), wrap with a small helper that respects the `--debug` flag.
+- CLI output must be **machine‑parsable** when `--json-output` is set (future feature).
+
+---
+
+## 5️⃣ Git & Pull‑Request Workflow (for agents)
+
+1. **Branch naming** – `feature/<short‑desc>` or `bugfix/<short‑desc>`.
+2. **Commits** – One logical change per commit, with a concise subject line (<50 chars) and an optional body explaining _why_ the change was needed.
+3. **CI** – Every PR must pass `go test ./...`, `go vet ./...`, and `golangci‑lint run` (if configured).
+4. **Review** – Agents should add comments only via the LLM; do not edit code generated by the LLM unless a test fails.
+5. **Rebasing** – Keep a linear history; use `git rebase -i` locally before merging.
+
+---
+
+## 6️⃣ Cursor / Copilot Rules (None Present)
+
+The repository does not contain a `.cursor` directory or a `.github/copilot‑instructions.md`
+file. If such files are added in the future, agents should read them and incorporate
+any additional constraints into this document.
+
+---
+
+## 7️⃣ Quick Reference Cheat‑Sheet (for agents)
+
+```bash
+# Build
+go build -o bin/pierre ./cmd/pierre/main.go
+
+# Run single test (replace TestFoo)
+go test ./... -run ^TestFoo$ -v
+
+# Lint & Vet
+go vet ./...
+staticcheck ./...
+# optional
+golangci-lint run
+
+# Format & Imports
+go fmt ./...
+goimports -w .
+
+# Module tidy
+go mod tidy
+```
+
+---
+
+_End of AGENTS.md_
+
@@ -1,77 +0,0 @@
-# Pierre Bot
-
-Pierre Bot is an intelligent, AI-powered code review assistant designed for Bitbucket Server/Data Center. It automates the initial pass of code review by analyzing Pull Request diffs and identifying potential bugs, logic errors, and style issues using modern LLMs (Google Gemini 2.0 Flash or Ollama).
-
-## Project Overview
-
-*   **Type:** Go CLI Application
-*   **Core Function:** Fetches PR diffs from Bitbucket -> Sends to LLM -> Prints structured review comments.
-*   **Key Technologies:**
-    *   **Language:** Go (1.25+)
-    *   **AI SDKs:** `google/generative-ai-go`, `ollama/ollama`
-    *   **CLI Framework:** `alecthomas/kong`
-
-## Architecture
-
-The project follows a standard Go project layout:
-
-*   **`cmd/pierre/`**: Contains the `main.go` entry point. It handles configuration parsing (flags, env vars, file), initializes adapters, and orchestrates the application flow.
-*   **`internal/pierre/`**: Contains the core business logic.
-    *   `judge.go`: Defines the `JudgePR` function which prepares the system prompt and context for the LLM.
-*   **`internal/chatter/`**: Abstraction layer for LLM providers.
-    *   `gemini.go`: Implements the `ChatAdapter` interface for Google Gemini. notably includes **dynamic JSON schema generation** via reflection (`schemaFromType`) to enforce structured output from the model.
-    *   `ollama.go`: Implements the `ChatAdapter` for Ollama (local models).
-*   **`internal/gitadapters/`**: Abstraction for Version Control Systems.
-    *   `bitbucket.go`: Client for fetching PR diffs from Bitbucket Server.
-
-## Building and Running
-
-### Prerequisites
-
-*   Go 1.25 or later
-*   Access to a Bitbucket Server instance
-*   API Key for Google Gemini (or a running Ollama instance)
-
-### Build
-
-```bash
-go build -o pierre ./cmd/pierre/main.go
-```
-
-### Configuration
-
-Configuration is handled via `kong` and supports a hierarchy: **Flags > Env Vars > Config File**.
-
-**1. Environment Variables:**
-
-*   `BITBUCKET_URL`: Base URL of the Bitbucket instance.
-*   `BITBUCKET_TOKEN`: Personal Access Token (HTTP) for Bitbucket.
-*   `LLM_PROVIDER`: `gemini` or `ollama`.
-*   `LLM_API_KEY`: API Key for Gemini.
-*   `LLM_MODEL`: Model name (e.g., `gemini-2.0-flash`).
-
-**2. Configuration File (`config.yaml`):**
-
-See `config.example.yaml` for a template. Place it in the current directory or `~/.pierre.yaml`.
-
-### Usage
-
-Run the bot against a specific Pull Request:
-
-```bash
-# Syntax: ./pierre [flags] <PROJECT_KEY> <REPO_SLUG> <PR_ID>
-./pierre --llm-provider=gemini --llm-model=gemini-2.0-flash MYPROJ my-repo 123
-```
-
-## Development Conventions
-
-*   **Structured Output:** The bot relies on the LLM returning valid JSON matching the `Comment` struct. This is enforced in `internal/chatter/gemini.go` by converting the Go struct definition into a `genai.Schema`.
-*   **Dependency Injection:** Adapters (`gitadapters`, `chatter`) are initialized in `main` and passed to the core logic, making testing easier.
-*   **Error Handling:** strict error checks are preferred; the bot will exit if it cannot fetch the diff or initialize the AI.
-
-## Key Files
-
-*   **`cmd/pierre/main.go`**: Application entry point and config wiring.
-*   **`internal/pierre/judge.go`**: The "brain" that constructs the prompt for the AI.
-*   **`internal/chatter/gemini.go`**: Gemini integration logic, including the reflection-based schema generator.
-*   **`config.example.yaml`**: Reference configuration file.
@@ -8,7 +8,7 @@ It fetches pull request diffs, analyzes them using Google's Gemini 2.0 Flash mod
 Ensure you have [Go](https://go.dev/) installed, then clone the repository and build the binary:

 ```bash
-git clone https://bitbucket.bit.admin.ch/scm/~u80859501/pierre-bot.git
+git clone https://git.schreifuchs.ch/schreifuchs/pierre-bot.git
 cd pierre-bot
 go build -o pierre ./cmd/pierre/main.go

@@ -2,15 +2,16 @@ package main

 import (
 	"context"
-	"fmt"
 	"log"
+	"log/slog"
 	"os"
 	"path/filepath"
+	"strings"

-	"bitbucket.bit.admin.ch/scm/~u80859501/pierre-bot/internal/chatter"
-	"bitbucket.bit.admin.ch/scm/~u80859501/pierre-bot/internal/gitadapters"
-	"bitbucket.bit.admin.ch/scm/~u80859501/pierre-bot/internal/gitadapters/gitea"
-	"bitbucket.bit.admin.ch/scm/~u80859501/pierre-bot/internal/pierre"
+	"git.schreifuchs.ch/schreifuchs/pierre-bot/internal/chatter"
+	"git.schreifuchs.ch/schreifuchs/pierre-bot/internal/gitadapters/bitbucket"
+	"git.schreifuchs.ch/schreifuchs/pierre-bot/internal/gitadapters/gitea"
+	"git.schreifuchs.ch/schreifuchs/pierre-bot/internal/pierre"
 	"github.com/alecthomas/kong"
 	kongyaml "github.com/alecthomas/kong-yaml"
 )
@@ -33,12 +34,27 @@ type RepoArgs struct {

 type LLMConfig struct {
 	Provider string `help:"Provider for llm (ollama or gemini)" required:"" env:"LLM_PROVIDER"`
-	Endpoint string `help:"Endpoint for provider (only for ollama)" env:"LLM_ENDPOINT"`
+	BaseURL  string `help:"Endpoint for provider (only for ollama)" env:"LLM_BASE_URL"`
 	APIKey   string `help:"APIKey for provider" env:"LLM_API_KEY"`
 	Model    string `help:"Model to use" env:"LLM_MODEL"`
 }

+// ReviewConfig holds the review‑specific CLI options.
+// The `default:"60000"` tag sets an integer default of 60 KB – Kong parses the string value into the int field, which can be confusing for readers.
+type ReviewConfig struct {
+	MaxChunkSize    int      `help:"Maximum diff chunk size in bytes" default:"60000"`
+	Guidelines      []string `help:"Project guidelines to prepend" sep:","`
+	DisableComments bool     `help:"Disable posting comments (dry run)"`
+	SanityCheck     bool     `help:"Run sanity‑check LLM prompts per comment" default:"true"`
+}
+
 type Config struct {
+	LogLevel string `help:"Log verbosity: debug, info, warn, error" default:"info"`
+
+	// Embedding ReviewConfig with a prefix changes flag names to `--review-…`.
+	// Existing configuration files using the old flag names will need to be updated.
+	// Consider keeping backwards compatibility if required.
+	Review      ReviewConfig    `embed:"" prefix:"review-"`
 	GitProvider string          `help:"Git provider (bitbucket or gitea)" env:"GIT_PROVIDER"`
 	Bitbucket   BitbucketConfig `embed:"" prefix:"bitbucket-"`
 	Gitea       GiteaConfig     `embed:"" prefix:"gitea-"`
@@ -57,13 +73,31 @@ func main() {
 	defaultConfig := filepath.Join(home, ".config", "pierre", "config.yaml")

 	// Parse flags, env vars, and config files
-	ctx := kong.Parse(cfg,
+	kong.Parse(cfg,
 		kong.Name("pierre"),
 		kong.Description("AI-powered Pull Request reviewer"),
 		kong.UsageOnError(),
 		kong.Configuration(kongyaml.Loader, "config.yaml", defaultConfig),
 	)

+	// Configure global slog logger based on the --log-level flag
+	lvl := slog.LevelInfo
+	switch strings.ToLower(cfg.LogLevel) {
+	case "debug":
+		lvl = slog.LevelDebug
+	case "info":
+		lvl = slog.LevelInfo
+	case "warn":
+		lvl = slog.LevelWarn
+	case "error":
+		lvl = slog.LevelError
+	}
+
+	// Logs are sent to stderr so that stdout can be safely piped.
+	// The review output (fmt.Printf) stays on stdout unchanged.
+	logger := slog.New(slog.NewTextHandler(os.Stderr, &slog.HandlerOptions{Level: lvl}))
+	slog.SetDefault(logger)
+
 	// Auto-detect provider
 	provider := cfg.GitProvider
 	if provider == "" {
@@ -78,18 +112,20 @@ func main() {
 		}
 	}

-	var adapter gitadapters.Adapter
+	var git pierre.GitAdapter
+
 	switch provider {
 	case "bitbucket":
 		if cfg.Bitbucket.BaseURL == "" {
 			log.Fatal("Bitbucket Base URL is required when using bitbucket provider.")
 		}
-		adapter = gitadapters.NewBitbucket(cfg.Bitbucket.BaseURL, cfg.Bitbucket.Token)
+		git = bitbucket.NewBitbucket(cfg.Bitbucket.BaseURL, cfg.Bitbucket.Token)
 	case "gitea":
 		if cfg.Gitea.BaseURL == "" {
 			log.Fatal("Gitea Base URL is required when using gitea provider.")
 		}
-		adapter, err = gitea.New(cfg.Gitea.BaseURL, cfg.Gitea.Token)
+		var err error
+		git, err = gitea.New(cfg.Gitea.BaseURL, cfg.Gitea.Token)
 		if err != nil {
 			log.Fatalf("Error initializing Gitea adapter: %v", err)
 		}
@@ -97,20 +133,17 @@ func main() {
 		log.Fatalf("Unknown git provider: %s", provider)
 	}

-	// Fetch Diff using positional args from shared RepoArgs
-	diff, err := adapter.GetDiff(cfg.Repo.Owner, cfg.Repo.Repo, cfg.Repo.PRID)
-	if err != nil {
-		log.Fatalf("Error fetching diff: %v", err)
-	}
-
 	// Initialize AI Adapter
-	var ai chatter.ChatAdapter
+	var ai pierre.ChatAdapter

 	switch cfg.LLM.Provider {
 	case "gemini":
 		ai, err = chatter.NewGeminiAdapter(context.Background(), cfg.LLM.APIKey, cfg.LLM.Model)
 	case "ollama":
-		ai, err = chatter.NewOllamaAdapter(cfg.LLM.Endpoint, cfg.LLM.Model)
+		ai, err = chatter.NewOllamaAdapter(cfg.LLM.BaseURL, cfg.LLM.Model)
+	case "openai":
+		ai = chatter.NewOpenAIAdapter(cfg.LLM.APIKey, cfg.LLM.Model, cfg.LLM.BaseURL)
+
 	default:
 		log.Fatalf("%s is not a valid llm provider", cfg.LLM.Provider)
 	}
@@ -119,18 +152,9 @@ func main() {
 		log.Fatalf("Error initializing AI: %v", err)
 	}

-	// Run Logic
-	comments, err := pierre.JudgePR(context.Background(), ai, diff)
-	if err != nil {
-		log.Fatalf("Error judging PR: %v", err)
+	pierreService := pierre.New(ai, git, cfg.Review.MaxChunkSize, cfg.Review.Guidelines, cfg.Review.DisableComments)
+	pierreService.SetSanityCheck(cfg.Review.SanityCheck)
+	if err := pierreService.MakeReview(context.Background(), cfg.Repo.Owner, cfg.Repo.Repo, cfg.Repo.PRID); err != nil {
+		log.Fatalf("Error during review: %v", err)
 	}
-
-	fmt.Printf("Analysis complete. Found %d issues.\n---\n", len(comments))
-
-	for _, c := range comments {
-		fmt.Printf("File: %s\nLine: %d\nMessage: %s\n%s\n",
-			c.File, c.Line, c.Message, "---")
-	}
-
-	_ = ctx
 }
@@ -1,4 +1,4 @@
-module bitbucket.bit.admin.ch/scm/~u80859501/pierre-bot
+module git.schreifuchs.ch/schreifuchs/pierre-bot

 go 1.25.0

@@ -7,7 +7,9 @@ require (
 	github.com/alecthomas/kong v1.14.0
 	github.com/alecthomas/kong-yaml v0.2.0
 	github.com/google/generative-ai-go v0.20.1
+	github.com/google/go-cmp v0.7.0
 	github.com/ollama/ollama v0.16.0
+	github.com/sashabaranov/go-openai v1.41.2
 	google.golang.org/api v0.186.0
 )

@@ -100,6 +100,8 @@ github.com/ollama/ollama v0.16.0/go.mod h1:FEk95NbAJJZk+t7cLh+bPGTul72j1O3PLLlYN
 github.com/pmezard/go-difflib v1.0.0 h1:4DBwDE0NGyQoBHbLQYPwSUPoCMWR5BEzIk/f1lZbAQM=
 github.com/pmezard/go-difflib v1.0.0/go.mod h1:iKH77koFhYxTK1pcRnkKkqfTogsbg7gZNVY4sRDYZ/4=
 github.com/prometheus/client_model v0.0.0-20190812154241-14fe0d1b01d4/go.mod h1:xMI15A0UPsDsEKsMN9yxemIoYk6Tm2C1GtYGdfGttqA=
+github.com/sashabaranov/go-openai v1.41.2 h1:vfPRBZNMpnqu8ELsclWcAvF19lDNgh1t6TVfFFOPiSM=
+github.com/sashabaranov/go-openai v1.41.2/go.mod h1:lj5b/K+zjTSFxVLijLSTDZuP7adOgerWeFyZLUhAKRg=
 github.com/stretchr/objx v0.1.0/go.mod h1:HFkY916IF+rwdDfMAkV7OtwuqBVzrE8GR6GFx+wExME=
 github.com/stretchr/objx v0.4.0/go.mod h1:YvHI0jy2hoMjB+UWwv71VJQ9isScKT/TqJzVSSt89Yw=
 github.com/stretchr/objx v0.5.0/go.mod h1:Yh+to48EsGEfYuaHDzXPcE3xhTkx73EhmCGUpEOglKo=
@@ -1,9 +1,5 @@
 package chatter

-import (
-	"context"
-)
-
 // Role defines who sent the message
 type Role string

@@ -19,10 +15,6 @@ type Message struct {
 	Content string
 }

-type ChatAdapter interface {
-	GenerateStructured(ctx context.Context, messages []Message, target interface{}) error
-}
-
 // PredictionConfig allows for per-request overrides
 type PredictionConfig struct {
 	Temperature float64
@@ -0,0 +1,184 @@
+package chatter
+
+import (
+	"context"
+	"crypto/tls"
+	"encoding/json"
+	"fmt"
+	"net/http"
+	"reflect"
+
+	"github.com/sashabaranov/go-openai"
+	"github.com/sashabaranov/go-openai/jsonschema"
+)
+
+type OpenAIAdapter struct {
+	client *openai.Client
+	model  string
+}
+
+func NewOpenAIAdapter(apiKey string, model string, baseURL string) *OpenAIAdapter {
+	config := openai.DefaultConfig(apiKey)
+
+	tr := &http.Transport{
+		TLSClientConfig: &tls.Config{
+			InsecureSkipVerify: true, // Bypasses the "not standards compliant" error
+		},
+	}
+
+	config.HTTPClient = &http.Client{Transport: tr}
+
+	if baseURL != "" {
+		config.BaseURL = baseURL
+	}
+	if baseURL != "" {
+		config.BaseURL = baseURL
+	}
+	return &OpenAIAdapter{
+		client: openai.NewClientWithConfig(config),
+		model:  model,
+	}
+}
+
+func (a *OpenAIAdapter) Generate(ctx context.Context, messages []Message) (string, error) {
+	var chatMsgs []openai.ChatCompletionMessage
+	for _, m := range messages {
+		chatMsgs = append(chatMsgs, openai.ChatCompletionMessage{
+			Role:    string(m.Role),
+			Content: m.Content,
+		})
+	}
+
+	resp, err := a.client.CreateChatCompletion(ctx, openai.ChatCompletionRequest{
+		Model:    a.model,
+		Messages: chatMsgs,
+	})
+	if err != nil {
+		return "", err
+	}
+	return resp.Choices[0].Message.Content, nil
+}
+
+func (a *OpenAIAdapter) GenerateStructured(ctx context.Context, messages []Message, target any) error {
+	val := reflect.ValueOf(target)
+	if val.Kind() != reflect.Ptr {
+		return fmt.Errorf("target must be a pointer")
+	}
+
+	elem := val.Elem()
+	var schemaType reflect.Type
+	isSlice := elem.Kind() == reflect.Slice
+
+	// 1. Wrap slices in an object because OpenAI requires a root object
+	if isSlice {
+		schemaType = reflect.StructOf([]reflect.StructField{
+			{
+				Name: "Items",
+				Type: elem.Type(),
+				Tag:  `json:"items"`,
+			},
+		})
+	} else {
+		schemaType = elem.Type()
+	}
+
+	// 2. Build the Schema Map
+	schemaObj := a.reflectTypeToSchema(schemaType)
+
+	// 3. Convert to json.RawMessage to satisfy the json.Marshaler interface
+	schemaBytes, err := json.Marshal(schemaObj)
+	if err != nil {
+		return fmt.Errorf("failed to marshal schema: %w", err)
+	}
+
+	var chatMsgs []openai.ChatCompletionMessage
+	for _, m := range messages {
+		chatMsgs = append(chatMsgs, openai.ChatCompletionMessage{
+			Role:    string(m.Role),
+			Content: m.Content,
+		})
+	}
+
+	// 4. Send Request
+	req := openai.ChatCompletionRequest{
+		Model:    a.model,
+		Messages: chatMsgs,
+		ResponseFormat: &openai.ChatCompletionResponseFormat{
+			Type: openai.ChatCompletionResponseFormatTypeJSONSchema,
+			JSONSchema: &openai.ChatCompletionResponseFormatJSONSchema{
+				Name:   "output_schema",
+				Strict: true,
+				Schema: json.RawMessage(schemaBytes),
+			},
+		},
+	}
+
+	resp, err := a.client.CreateChatCompletion(ctx, req)
+	if err != nil {
+		return err
+	}
+
+	content := resp.Choices[0].Message.Content
+
+	// 5. Unmarshal and Unwrap if necessary
+	if isSlice {
+		temp := struct {
+			Items json.RawMessage `json:"items"`
+		}{}
+		if err := json.Unmarshal([]byte(content), &temp); err != nil {
+			return err
+		}
+		return json.Unmarshal(temp.Items, target)
+	}
+
+	return json.Unmarshal([]byte(content), target)
+}
+
+func (a *OpenAIAdapter) reflectTypeToSchema(t reflect.Type) jsonschema.Definition {
+	for t.Kind() == reflect.Ptr {
+		t = t.Elem()
+	}
+
+	switch t.Kind() {
+	case reflect.Struct:
+		def := jsonschema.Definition{
+			Type:                 jsonschema.Object,
+			Properties:           make(map[string]jsonschema.Definition),
+			AdditionalProperties: false,
+			Required:             []string{},
+		}
+		for i := 0; i < t.NumField(); i++ {
+			field := t.Field(i)
+			if !field.IsExported() {
+				continue
+			}
+			name := field.Tag.Get("json")
+			if name == "" || name == "-" {
+				name = field.Name
+			}
+			def.Properties[name] = a.reflectTypeToSchema(field.Type)
+			def.Required = append(def.Required, name)
+		}
+		return def
+
+	case reflect.Slice, reflect.Array:
+		items := a.reflectTypeToSchema(t.Elem())
+		return jsonschema.Definition{
+			Type:  jsonschema.Array,
+			Items: &items,
+		}
+
+	case reflect.Int, reflect.Int8, reflect.Int16, reflect.Int32, reflect.Int64:
+		return jsonschema.Definition{Type: jsonschema.Integer}
+	case reflect.Float32, reflect.Float64:
+		return jsonschema.Definition{Type: jsonschema.Number}
+	case reflect.Bool:
+		return jsonschema.Definition{Type: jsonschema.Boolean}
+	default:
+		return jsonschema.Definition{Type: jsonschema.String}
+	}
+}
+
+func (a *OpenAIAdapter) GetProviderName() string {
+	return "OpenAI (" + a.model + ")"
+}
@@ -1,7 +0,0 @@
-package gitadapters
-
-import "io"
-
-type Adapter interface {
-	GetDiff(owner, repo string, prID int) (io.Reader, error)
-}
@@ -1,31 +0,0 @@
-package gitadapters
-
-import (
-	"fmt"
-	"io"
-	"net/http"
-	"net/url"
-)
-
-type baseHTTP struct {
-	baseURL     string
-	bearerToken string
-}
-
-func (b *baseHTTP) createRequest(method string, body io.Reader, path ...string) (r *http.Request, err error) {
-	target, err := url.JoinPath(b.baseURL, path...)
-	if err != nil {
-		err = fmt.Errorf("can not parse path: %w", err)
-		return
-	}
-	req, err := http.NewRequest(method, target, body)
-	if err != nil {
-		return nil, err
-	}
-
-	if b.bearerToken != "" {
-		req.Header.Set("Authorization", "Bearer "+b.bearerToken)
-	}
-
-	return req, nil
-}
@@ -0,0 +1,62 @@
+package baseadapter
+
+import (
+	"bytes"
+	"context"
+	"encoding/json"
+	"fmt"
+	"io"
+	"net/http"
+	"net/url"
+)
+
+type Rest struct {
+	baseURL     string
+	bearerToken string
+}
+
+func NewRest(baseURL string, bearerToken string) Rest {
+	return Rest{
+		baseURL:     baseURL,
+		bearerToken: bearerToken,
+	}
+}
+
+const defaultBodyBufferSize = 100
+
+func (b *Rest) CreateRequest(ctx context.Context, method string, body any, path ...string) (r *http.Request, err error) {
+	target, err := url.JoinPath(b.baseURL, path...)
+	if err != nil {
+		err = fmt.Errorf("can not parse path: %w", err)
+		return
+	}
+
+	var bodyReader io.Reader
+	if body != nil {
+		bodyBuff := bytes.NewBuffer(make([]byte, 0, defaultBodyBufferSize))
+
+		err = json.NewEncoder(bodyBuff).Encode(body)
+		if err != nil {
+			return
+		}
+
+		bodyReader = bodyBuff
+	}
+
+	req, err := http.NewRequest(method, target, bodyReader)
+	if err != nil {
+		return nil, err
+	}
+
+	req = req.WithContext(ctx)
+
+	if body != nil {
+		req.Header.Set("Content-Type", "application/json")
+	}
+
+	if b.bearerToken != "" {
+		req.Header.Set("Authorization", "Bearer "+b.bearerToken)
+	}
+
+	return req, nil
+}
@@ -1,38 +0,0 @@
-package gitadapters
-
-import (
-	"fmt"
-	"io"
-	"net/http"
-)
-
-type BitbucketAdapter struct {
-	baseHTTP
-}
-
-func NewBitbucket(baseURL string, bearerToken string) *BitbucketAdapter {
-	return &BitbucketAdapter{
-		baseHTTP{
-			baseURL:     baseURL,
-			bearerToken: bearerToken,
-		},
-	}
-}
-
-func (b *BitbucketAdapter) GetDiff(projectKey, repositorySlug string, pullRequestID int) (diff io.Reader, err error) {
-	r, err := b.createRequest(
-		http.MethodGet,
-		nil,
-		"/rest/api/1.0/projects/", projectKey, "repos", repositorySlug, "pull-requests", fmt.Sprintf("%d.diff", pullRequestID),
-	)
-	if err != nil {
-		return
-	}
-
-	response, err := http.DefaultClient.Do(r)
-	if err != nil {
-		return
-	}
-	diff = response.Body
-	return
-}
@@ -0,0 +1,80 @@
+package bitbucket
+
+import (
+	"context"
+	"net/http"
+	"net/http/httptest"
+	"testing"
+)
+
+func TestBitbucketGetFileContentSuccess(t *testing.T) {
+	const expected = "file content"
+	server := httptest.NewServer(http.HandlerFunc(func(w http.ResponseWriter, r *http.Request) {
+		// Verify path structure
+		if r.URL.Path != "/rest/api/1.0/projects/owner/repos/repo/raw/path/to/file.go" {
+			t.Fatalf("unexpected URL path: %s", r.URL.Path)
+		}
+		w.WriteHeader(http.StatusOK)
+		w.Write([]byte(expected))
+	}))
+	defer server.Close()
+
+	// Trim trailing slash handling done in NewBitbucket
+	adapter := NewBitbucket(server.URL, "")
+	content, err := adapter.GetFileContent(context.Background(), "owner", "repo", "path/to/file.go", "")
+	if err != nil {
+		t.Fatalf("unexpected error: %v", err)
+	}
+	if content != expected {
+		t.Fatalf("expected %q, got %q", expected, content)
+	}
+}
+
+func TestBitbucketGetFileContentError(t *testing.T) {
+	server := httptest.NewServer(http.HandlerFunc(func(w http.ResponseWriter, r *http.Request) {
+		w.WriteHeader(http.StatusNotFound)
+		w.Write([]byte("not found"))
+	}))
+	defer server.Close()
+
+	adapter := NewBitbucket(server.URL, "")
+	_, err := adapter.GetFileContent(context.Background(), "owner", "repo", "missing.go", "")
+	if err == nil {
+		t.Fatalf("expected error for non‑200 response")
+	}
+}
+
+func TestBitbucketGetPRHeadSHASuccess(t *testing.T) {
+	const sha = "deadbeef"
+	server := httptest.NewServer(http.HandlerFunc(func(w http.ResponseWriter, r *http.Request) {
+		if r.URL.Path != "/rest/api/1.0/projects/owner/repos/repo/pull-requests/42" {
+			t.Fatalf("unexpected URL: %s", r.URL.Path)
+		}
+		w.WriteHeader(http.StatusOK)
+		w.Write([]byte(`{"toRef":{"latestCommit":"` + sha + `"}}`))
+	}))
+	defer server.Close()
+
+	adapter := NewBitbucket(server.URL, "")
+	got, err := adapter.GetPRHeadSHA(context.Background(), "owner", "repo", 42)
+	if err != nil {
+		t.Fatalf("unexpected error: %v", err)
+	}
+	if got != sha {
+		t.Fatalf("expected sha %s, got %s", sha, got)
+	}
+}
+
+func TestBitbucketGetPRHeadSHAError(t *testing.T) {
+	server := httptest.NewServer(http.HandlerFunc(func(w http.ResponseWriter, r *http.Request) {
+		w.WriteHeader(http.StatusInternalServerError)
+		w.Write([]byte("error"))
+	}))
+	defer server.Close()
+
+	adapter := NewBitbucket(server.URL, "")
+	_, err := adapter.GetPRHeadSHA(context.Background(), "owner", "repo", 1)
+	if err == nil {
+		t.Fatalf("expected error for non‑200 response")
+	}
+}
@@ -0,0 +1,113 @@
+package bitbucket
+
+import (
+	"context"
+	"encoding/json"
+	"fmt"
+	"io"
+	"log/slog"
+	"net/http"
+	"strconv"
+	"strings"
+
+	"git.schreifuchs.ch/schreifuchs/pierre-bot/internal/pierre"
+)
+
+func (b *BitbucketAdapter) GetDiff(ctx context.Context, projectKey, repositorySlug string, pullRequestID int) (diff io.ReadCloser, err error) {
+	slog.Debug("Bitbucket GetDiff start", "project", projectKey, "repo", repositorySlug, "pr", pullRequestID)
+	r, err := b.CreateRequest(
+		ctx,
+		http.MethodGet,
+		nil,
+		"/projects/", projectKey, "repos", repositorySlug, "pull-requests", fmt.Sprintf("%d.diff", pullRequestID),
+	)
+	if err != nil {
+		return
+	}
+
+	response, err := http.DefaultClient.Do(r)
+	if err != nil {
+		return
+	}
+
+	if response.StatusCode != http.StatusOK {
+		sb := &strings.Builder{}
+		io.Copy(sb, response.Body)
+		err = fmt.Errorf("error while fetching bitbucket diff status %d, body %s", response.StatusCode, sb.String())
+	}
+
+	diff = response.Body
+	return
+}
+
+func (b *BitbucketAdapter) GetPR(ctx context.Context, projectKey, repositorySlug string, pullRequestID int) (pr PullRequest, err error) {
+	slog.Debug("Bitbucket GetPR start", "project", projectKey, "repo", repositorySlug, "pr", pullRequestID)
+	r, err := b.CreateRequest(
+		ctx,
+		http.MethodGet,
+		nil,
+		"/projects/", projectKey, "repos", repositorySlug, "pull-requests", strconv.Itoa(pullRequestID),
+	)
+
+	response, err := http.DefaultClient.Do(r)
+	if err != nil {
+		return
+	}
+	defer response.Body.Close() // Add this
+
+	err = json.NewDecoder(response.Body).Decode(&pr)
+	if err != nil {
+		slog.Error("Bitbucket GetPR decode error", "err", err)
+		return
+	}
+	slog.Info("Bitbucket GetPR success", "id", pullRequestID)
+
+	return
+}
+
+func (b *BitbucketAdapter) AddComment(ctx context.Context, owner, repo string, prID int, comment pierre.Comment) (err error) {
+	slog.Debug("Bitbucket AddComment start", "owner", owner, "repo", repo, "pr", prID, "file", comment.File, "line", comment.Line)
+	// pr, err := b.GetPR(ctx, owner, repo, prID)
+	// if err != nil {
+	// 	return
+	// }
+
+	commentDTO := Comment{
+		Content: comment.Message,
+		Anchor: Anchor{
+			Path:     comment.File,
+			Line:     comment.Line,
+			LineType: "ADDED",
+			FileType: "TO",
+			DiffType: "EFFECTIVE",
+			// FromHash: pr.ToRef.LatestCommit,
+			// ToHash:   pr.FromRef.LatestCommit,
+		},
+	}
+
+	r, err := b.CreateRequest(ctx,
+		http.MethodPost,
+		commentDTO,
+		"/projects/", owner, "/repos/", repo, "/pull-requests/", strconv.Itoa(prID), "/comments",
+	)
+	if err != nil {
+		return
+	}
+
+	response, err := http.DefaultClient.Do(r)
+	if err != nil {
+		return err
+	}
+	defer response.Body.Close() // Add this
+
+	if response.StatusCode >= 300 || response.StatusCode < 200 {
+		sb := &strings.Builder{}
+		io.Copy(sb, response.Body)
+		err = fmt.Errorf("error while creating comment staus %d, body %s", response.StatusCode, sb.String())
+		slog.Error("Bitbucket AddComment failed", "status", response.StatusCode, "err", err)
+	} else {
+		slog.Info("Bitbucket AddComment succeeded", "pr", prID)
+	}
+
+	return err
+}
@@ -0,0 +1,44 @@
+package bitbucket
+
+type Anchor struct {
+	Path     string `json:"path"`
+	Line     int    `json:"line"`
+	LineType string `json:"lineType,omitempty"`
+	FileType string `json:"fileType"`
+	FromHash string `json:"fromHash,omitempty"`
+	ToHash   string `json:"toHash,omitempty"`
+	DiffType string `json:"diffType,omitempty"`
+}
+
+type Comment struct {
+	Content string `json:"text"`
+	Anchor  Anchor `json:"anchor"`
+}
+
+type PullRequest struct {
+	ID          int64  `json:"id"`
+	Version     int    `json:"version"`
+	Title       string `json:"title"`
+	State       string `json:"state"`
+	Open        bool   `json:"open"`
+	Closed      bool   `json:"closed"`
+	FromRef     Ref    `json:"fromRef"`
+	ToRef       Ref    `json:"toRef"`
+	Description string `json:"description"`
+}
+
+type Ref struct {
+	ID           string     `json:"id"`
+	DisplayID    string     `json:"displayId"`
+	LatestCommit string     `json:"latestCommit"`
+	Repository   Repository `json:"repository"`
+}
+
+type Repository struct {
+	Slug    string  `json:"slug"`
+	Project Project `json:"project"`
+}
+
+type Project struct {
+	Key string `json:"key"`
+}
@@ -0,0 +1,66 @@
+package bitbucket
+
+import (
+	"context"
+	"fmt"
+	"io"
+	"net/http"
+	"strings"
+
+	"git.schreifuchs.ch/schreifuchs/pierre-bot/internal/gitadapters/baseadapter"
+)
+
+type BitbucketAdapter struct {
+	baseadapter.Rest
+}
+
+func NewBitbucket(baseURL string, bearerToken string) *BitbucketAdapter {
+	baseURL, _ = strings.CutSuffix(baseURL, "/")
+	baseURL += "/rest/api/1.0"
+
+	return &BitbucketAdapter{
+		Rest: baseadapter.NewRest(baseURL, bearerToken),
+	}
+}
+
+// GetFileContent returns the raw file content at the given ref (commit SHA) or HEAD if ref is empty.
+func (b *BitbucketAdapter) GetFileContent(ctx context.Context, projectKey, repositorySlug, path, ref string) (string, error) {
+	// Use the Rest helper to build the base URL, then add the "at" query param if needed.
+	r, err := b.CreateRequest(ctx, http.MethodGet, nil,
+		"/projects/", projectKey, "repos", repositorySlug, "raw", path)
+	if err != nil {
+		return "", err
+	}
+	// Ensure raw file retrieval
+	r.Header.Set("Accept", "application/octet-stream")
+	if ref != "" {
+		q := r.URL.Query()
+		q.Set("at", ref)
+		r.URL.RawQuery = q.Encode()
+	}
+
+	resp, err := http.DefaultClient.Do(r)
+	if err != nil {
+		return "", err
+	}
+	defer resp.Body.Close()
+	if resp.StatusCode != http.StatusOK {
+		sb := &strings.Builder{}
+		io.Copy(sb, resp.Body)
+		return "", fmt.Errorf("error fetching file %s status %d, body %s", path, resp.StatusCode, sb.String())
+	}
+	content, err := io.ReadAll(resp.Body)
+	if err != nil {
+		return "", err
+	}
+	return string(content), nil
+}
+
+// GetPRHeadSHA fetches the PR and returns the SHA of the source (to) branch.
+func (b *BitbucketAdapter) GetPRHeadSHA(ctx context.Context, projectKey, repositorySlug string, pullRequestID int) (string, error) {
+	pr, err := b.GetPR(ctx, projectKey, repositorySlug, pullRequestID)
+	if err != nil {
+		return "", err
+	}
+	return pr.ToRef.LatestCommit, nil
+}
@@ -2,9 +2,14 @@ package gitea

 import (
 	"bytes"
+	"context"
+	"fmt"
 	"io"
+	"log/slog"
+	"net/http"

 	"code.gitea.io/sdk/gitea"
+	"git.schreifuchs.ch/schreifuchs/pierre-bot/internal/pierre"
 )

 type Adapter struct {
@@ -21,10 +26,68 @@ func New(baseURL, token string) (*Adapter, error) {
 	}, nil
 }

-func (g *Adapter) GetDiff(owner, repo string, prID int) (io.Reader, error) {
+func (g *Adapter) GetDiff(ctx context.Context, owner, repo string, prID int) (io.ReadCloser, error) {
+	slog.Debug("Gitea GetDiff start", "owner", owner, "repo", repo, "pr", prID)
+	g.client.SetContext(ctx)
 	diff, _, err := g.client.GetPullRequestDiff(owner, repo, int64(prID), gitea.PullRequestDiffOptions{})
 	if err != nil {
 		return nil, err
 	}
-	return bytes.NewReader(diff), nil
+	rc := io.NopCloser(bytes.NewReader(diff))
+	slog.Info("Gitea GetDiff success", "owner", owner, "repo", repo, "pr", prID, "bytes", len(diff))
+	return rc, nil
+}
+
+func (g *Adapter) AddComment(ctx context.Context, owner, repo string, prID int, comment pierre.Comment) error {
+	slog.Debug("Gitea AddComment start", "owner", owner, "repo", repo, "pr", prID, "file", comment.File, "line", comment.Line)
+	g.client.SetContext(ctx)
+	opts := gitea.CreatePullReviewOptions{
+		State: gitea.ReviewStateComment,
+		Comments: []gitea.CreatePullReviewComment{
+			{
+				Path:       comment.File,
+				Body:       comment.Message,
+				NewLineNum: int64(comment.Line),
+			},
+		},
+	}
+	_, resp, err := g.client.CreatePullReview(owner, repo, int64(prID), opts)
+	if err != nil {
+		slog.Error("Gitea AddComment failed", "err", err)
+		return err
+	}
+	if resp != nil && resp.StatusCode != http.StatusCreated {
+		return fmt.Errorf("unexpected status %d creating comment", resp.StatusCode)
+	}
+	slog.Info("Gitea AddComment succeeded", "pr", prID)
+	return nil
+}
+
+// GetFileContent returns the file content at a given path and ref (commit SHA).
+func (g *Adapter) GetFileContent(ctx context.Context, owner, repo, path, ref string) (string, error) {
+	slog.Debug("Gitea GetFileContent start", "owner", owner, "repo", repo, "path", path, "ref", ref)
+	g.client.SetContext(ctx)
+	// The SDK's GetFile returns the raw bytes of the file.
+	data, _, err := g.client.GetFile(owner, repo, ref, path)
+	if err != nil {
+		return "", err
+	}
+	slog.Info("Gitea GetFileContent success", "bytes", len(data))
+	return string(data), nil
+}
+
+// GetPRHeadSHA fetches the pull request and returns the head commit SHA.
+func (g *Adapter) GetPRHeadSHA(ctx context.Context, owner, repo string, prID int) (string, error) {
+	slog.Debug("Gitea GetPRHeadSHA start", "owner", owner, "repo", repo, "pr", prID)
+	g.client.SetContext(ctx)
+	// GetPullRequest returns the detailed PR information.
+	pr, _, err := g.client.GetPullRequest(owner, repo, int64(prID))
+	if err != nil {
+		return "", err
+	}
+	if pr == nil || pr.Head == nil {
+		return "", fmt.Errorf("pull request %d has no head information", prID)
+	}
+	slog.Info("Gitea GetPRHeadSHA success", "sha", pr.Head.Sha)
+	return pr.Head.Sha, nil
 }
@@ -1 +0,0 @@
-package gitadapters
@@ -0,0 +1,70 @@
+package pierre
+
+import (
+	"fmt"
+	"strings"
+	"testing"
+)
+
+func TestParseGuidelinesFromStringValid(t *testing.T) {
+	md := "# Rule One\n\n  - Item A  \n\n# Rule Two\n"
+	lines, err := parseGuidelinesFromString(md)
+	if err != nil {
+		t.Fatalf("unexpected error: %v", err)
+	}
+	expected := []string{"# Rule One", "- Item A", "# Rule Two"}
+	if got, want := fmt.Sprint(lines), fmt.Sprint(expected); got != want {
+		t.Fatalf("expected %v, got %v", expected, lines)
+	}
+}
+
+func TestParseGuidelinesFromStringEmpty(t *testing.T) {
+	md := "\n   \n"
+	lines, err := parseGuidelinesFromString(md)
+	if err != nil {
+		t.Fatalf("unexpected error: %v", err)
+	}
+	if len(lines) != 0 {
+		t.Fatalf("expected empty slice, got %d elements", len(lines))
+	}
+}
+
+func TestParseGuidelinesFromStringTooManyLines(t *testing.T) {
+	// generate 1001 non-empty lines
+	var sb strings.Builder
+	for i := 0; i < 1001; i++ {
+		sb.WriteString(fmt.Sprintf("Line %d\n", i))
+	}
+	_, err := parseGuidelinesFromString(sb.String())
+	if err == nil {
+		t.Fatalf("expected error for exceeding line limit, got nil")
+	}
+}
+
+func TestWithGuidelinesSuccess(t *testing.T) {
+	svc := &Service{}
+	md := "First line\nSecond line\n"
+	if err := svc.WithGuidelines(md); err != nil {
+		t.Fatalf("unexpected error: %v", err)
+	}
+	expected := []string{"First line", "Second line"}
+	if got, want := fmt.Sprint(svc.guidelines), fmt.Sprint(expected); got != want {
+		t.Fatalf("expected guidelines %v, got %v", expected, svc.guidelines)
+	}
+}
+
+func TestWithGuidelinesError(t *testing.T) {
+	svc := &Service{guidelines: []string{"old"}}
+	var sb strings.Builder
+	for i := 0; i < 1001; i++ {
+		sb.WriteString("x\n")
+	}
+	err := svc.WithGuidelines(sb.String())
+	if err == nil {
+		t.Fatalf("expected error, got nil")
+	}
+	// ensure old guidelines unchanged
+	if len(svc.guidelines) != 1 || svc.guidelines[0] != "old" {
+		t.Fatalf("guidelines should remain unchanged on error")
+	}
+}
@@ -4,36 +4,156 @@ import (
 	"context"
 	"fmt"
 	"io"
+	"log/slog"
+	"strings"

-	"bitbucket.bit.admin.ch/scm/~u80859501/pierre-bot/internal/chatter"
+	"git.schreifuchs.ch/schreifuchs/pierre-bot/internal/chatter"
 )

+// DefaultChunkSize is the fallback maximum size (in bytes) for a diff chunk when no explicit value is configured.
+const DefaultChunkSize = 60000
+
 type Comment struct {
 	File    string `json:"file"`
 	Line    int    `json:"line"`
 	Message string `json:"message"`
 }

-func JudgePR(ctx context.Context, chat chatter.ChatAdapter, diff io.Reader) (comments []Comment, err error) {
+func (s *Service) judgePR(ctx context.Context, diff io.Reader) (comments []Comment, err error) {
+	slog.Info("judgePR started")
 	diffBytes, err := io.ReadAll(diff)
 	if err != nil {
 		return nil, fmt.Errorf("failed to read diff: %w", err)
 	}
-	err = chat.GenerateStructured(ctx, []chatter.Message{
-		{
-			Role: chatter.RoleSystem,
-			Content: `
-				You are a very strict senior software architect.
-				You review **only** newly added or modified lines in a unified diff (lines prefixed with “+”), together with the immediate hunk context.
-				You do **not** report issues that appear **solely** in deleted lines (“-”) or that have already been fixed by the change.
-				No comments are made on pure formatting/whitespace changes or reordering that does not alter the program’s behavior.
-			`,
-		},
-		{
-			Role:    chatter.RoleUser,
-			Content: fmt.Sprintf("Hello please review my PR.\n Here is the git diff of it: %s", string(diffBytes)),
-		},
-	}, &comments)

+	// Determine chunk size (use default if not set)
+	maxSize := s.maxChunkSize
+	if maxSize <= 0 {
+		maxSize = DefaultChunkSize // default 60KB ~ 15k tokens
+	}
+
+	chunks := splitDiffIntoChunks(diffBytes, maxSize)
+	allComments := []Comment{}
+
+	// Build optional guidelines text (added as a separate section with a clear delimiter)
+	guidelinesText := ""
+	if len(s.guidelines) > 0 {
+		// Two newlines ensure the guidelines start on a fresh paragraph.
+		guidelinesText = "\n\nProject guidelines:\n"
+		for _, g := range s.guidelines {
+			guidelinesText += "- " + g + "\n"
+		}
+	}
+
+	// System prompt that instructs the LLM precisely.
+	baseSystem := strings.TrimSpace(`
+You are a strict senior software architect.
+Only comment on newly added or modified lines in the diff; ignore deletions, pure formatting, or re‑ordering that does not change behavior.
+For each issue output a JSON object with fields "file", "line", and "message" (message should be concise, ≤2 sentences, and actionable).
+If project guidelines are provided, treat them as hard rules that must be respected.`) + guidelinesText
+
+	for i, chunk := range chunks {
+		// Include the chunk identifier in the system message only if there are multiple chunks.
+		systemContent := baseSystem
+		if len(chunks) > 1 {
+			systemContent = fmt.Sprintf("%s\nChunk %d of %d.", baseSystem, i+1, len(chunks))
+		}
+		userContent := chunk
+
+		var chunkComments []Comment
+		err = s.chat.GenerateStructured(ctx, []chatter.Message{{
+			Role:    chatter.RoleSystem,
+			Content: systemContent,
+		}, {
+			Role:    chatter.RoleUser,
+			Content: userContent,
+		}}, &chunkComments)
+		if err != nil {
+			return nil, err
+		}
+		allComments = append(allComments, chunkComments...)
+	}
+
+	// De‑duplicate comments (keyed by file:line)
+	unique := make(map[string]Comment)
+	for _, c := range allComments {
+		key := fmt.Sprintf("%s:%d", c.File, c.Line)
+		unique[key] = c
+	}
+	for _, v := range unique {
+		comments = append(comments, v)
+	}
+	slog.Info("judgePR finished", "comments", len(comments))
 	return
 }
+
+// splitDiffIntoChunks splits a diff into chunks that do not exceed maxSize bytes.
+// It tries to split on file boundaries ("diff --git") first, then on hunk boundaries (@@),
+// and finally on a hard byte limit.
+func splitDiffIntoChunks(diff []byte, maxSize int) []string {
+	// Preserve the file header for each chunk when a single file's diff is split across multiple chunks.
+	// The header is the portion before the first hunk marker "@@" (including the "diff --git" line).
+	// When we need to split by hunks, we prepend this header to every resulting sub‑chunk.
+
+	if len(diff) <= maxSize {
+		return []string{string(diff)}
+	}
+	content := string(diff)
+	// Split by file headers
+	parts := strings.Split(content, "\ndiff --git ")
+	chunks := []string{}
+	var current strings.Builder
+	for idx, part := range parts {
+		seg := part
+		if idx != 0 {
+			// Preserve the leading newline that was removed by Split
+			seg = "\n" + "diff --git " + part
+		}
+		if current.Len()+len(seg) > maxSize && current.Len() > 0 {
+			chunks = append(chunks, current.String())
+			current.Reset()
+		}
+		if len(seg) > maxSize {
+			// Determine if there is a hunk marker. If not, fall back to simple size‑based chunking.
+			headerEnd := strings.Index(seg, "\n@@ ")
+			if headerEnd == -1 {
+				// No hunk marker – split purely by size.
+				remaining := seg
+				for len(remaining) > maxSize {
+					chunks = append(chunks, remaining[:maxSize])
+					remaining = remaining[maxSize:]
+				}
+				current.WriteString(remaining)
+				continue
+			}
+			// Preserve the header up to the first hunk.
+			header := seg[:headerEnd+1] // include newline before "@@"
+			// Split the rest of the segment into hunks (excluding the header part).
+			hunks := strings.Split(strings.TrimPrefix(seg, header), "\n@@ ")
+			for _, h := range hunks {
+				// Reconstruct each hunk with its header and "@@ " prefix.
+				hseg := header + "@@ " + h
+				if current.Len()+len(hseg) > maxSize && current.Len() > 0 {
+					chunks = append(chunks, current.String())
+					current.Reset()
+				}
+				if len(hseg) > maxSize {
+					// If a single hunk exceeds maxSize, split it further.
+					for len(hseg) > maxSize {
+						chunks = append(chunks, hseg[:maxSize])
+						hseg = hseg[maxSize:]
+					}
+					current.WriteString(hseg)
+				} else {
+					current.WriteString(hseg)
+				}
+			}
+		} else {
+			current.WriteString(seg)
+		}
+	}
+	if current.Len() > 0 {
+		chunks = append(chunks, current.String())
+	}
+	return chunks
+}
@@ -0,0 +1,141 @@
+package pierre
+
+import (
+	"bytes"
+	"context"
+	"io"
+	"strings"
+	"testing"
+
+	"git.schreifuchs.ch/schreifuchs/pierre-bot/internal/chatter"
+	"github.com/google/go-cmp/cmp"
+)
+
+// mockChat implements the ChatAdapter interface for testing.
+type mockChat struct{ callCount int }
+
+func (m *mockChat) GenerateStructured(ctx context.Context, msgs []chatter.Message, target interface{}) error {
+	m.callCount++
+	if cSlice, ok := target.(*[]Comment); ok {
+		*cSlice = []Comment{{File: "file.go", Line: 1, Message: "test comment"}}
+		return nil
+	}
+	return nil
+}
+
+func (m *mockChat) GetProviderName() string { return "mock" }
+
+// mockGit implements the GitAdapter interface for testing.
+type mockGit struct{}
+
+func (g *mockGit) GetDiff(ctx context.Context, owner, repo string, prID int) (io.ReadCloser, error) {
+	diff := "diff --git a/file1.go b/file1.go\n+line1\n" + "diff --git a/file2.go b/file2.go\n+line2\n"
+	return io.NopCloser(bytes.NewReader([]byte(diff))), nil
+}
+
+func (g *mockGit) AddComment(ctx context.Context, owner, repo string, prID int, comment Comment) error {
+	return nil
+}
+
+func (g *mockGit) GetFileContent(ctx context.Context, owner, repo, path, ref string) (string, error) {
+	// For tests, return a simple placeholder content.
+	return "package main\n\nfunc placeholder() {}", nil
+}
+
+func (g *mockGit) GetPRHeadSHA(ctx context.Context, owner, repo string, prID int) (string, error) {
+	return "dummysha", nil
+}
+
+func TestSplitDiffIntoChunks(t *testing.T) {
+	cases := []struct {
+		name           string
+		diff           string
+		maxSize        int
+		wantChunks     int // 0 means we don't assert exact count
+		wantPrefixes   []string
+		checkRecombine bool
+	}{
+		{
+			name:           "small diff",
+			diff:           "diff --git a/file1.txt b/file1.txt\n+added line\n",
+			maxSize:        1000,
+			wantChunks:     1,
+			wantPrefixes:   []string{"diff --git a/file1.txt"},
+			checkRecombine: true,
+		},
+		{
+			name: "multiple files",
+			diff: "diff --git a/file1.txt b/file1.txt\n+added line 1\n" +
+				"diff --git a/file2.txt b/file2.txt\n+added line 2\n",
+			maxSize:        50,
+			wantChunks:     2,
+			wantPrefixes:   []string{"diff --git a/file1.txt", "diff --git a/file2.txt"},
+			checkRecombine: false,
+		},
+		{
+			name: "large single file",
+			diff: func() string {
+				line := "+very long added line that will be repeated many times to exceed the chunk size\n"
+				return "diff --git a/large.txt b/large.txt\n" + strings.Repeat(line, 200)
+			}(),
+			maxSize:        500,
+			wantChunks:     0,
+			wantPrefixes:   nil,
+			checkRecombine: true,
+		},
+	}
+
+	for _, tc := range cases {
+		t.Run(tc.name, func(t *testing.T) {
+			chunks := splitDiffIntoChunks([]byte(tc.diff), tc.maxSize)
+			if tc.wantChunks > 0 && len(chunks) != tc.wantChunks {
+				t.Fatalf("expected %d chunks, got %d", tc.wantChunks, len(chunks))
+			}
+			for i, prefix := range tc.wantPrefixes {
+				if i >= len(chunks) {
+					t.Fatalf("missing chunk %d for prefix check", i)
+				}
+				trimmed := strings.TrimPrefix(chunks[i], "\n")
+				if !strings.HasPrefix(trimmed, prefix) {
+					t.Fatalf("chunk %d does not start with expected prefix %q: %s", i, prefix, chunks[i])
+				}
+			}
+			for i, c := range chunks {
+				if tc.maxSize > 0 && len(c) > tc.maxSize {
+					t.Fatalf("chunk %d exceeds max size %d: %d", i, tc.maxSize, len(c))
+				}
+			}
+			if tc.checkRecombine {
+				recombined := strings.Join(chunks, "")
+				if diff := cmp.Diff(tc.diff, recombined); diff != "" {
+					t.Fatalf("recombined diff differs:\n%s", diff)
+				}
+			}
+		})
+	}
+}
+
+func TestJudgePR_ChunkAggregationAndDeduplication(t *testing.T) {
+	chatMock := &mockChat{}
+	svc := &Service{
+		maxChunkSize: 50,
+		guidelines:   nil,
+		git:          &mockGit{},
+		chat:         chatMock,
+	}
+	diffReader, err := svc.git.GetDiff(context.Background(), "", "", 0)
+	if err != nil {
+		t.Fatalf("failed to get diff: %v", err)
+	}
+	defer diffReader.Close()
+	comments, err := svc.judgePR(context.Background(), diffReader)
+	if err != nil {
+		t.Fatalf("judgePR error: %v", err)
+	}
+	if got, want := len(comments), 1; got != want {
+		t.Fatalf("expected %d comment after deduplication, got %d", want, got)
+	}
+	if chatMock.callCount != 2 {
+		t.Fatalf("expected mockChat to be called for each chunk (2), got %d", chatMock.callCount)
+	}
+}
@@ -0,0 +1,44 @@
+package pierre
+
+import (
+	"context"
+	"testing"
+
+	"git.schreifuchs.ch/schreifuchs/pierre-bot/internal/chatter"
+)
+
+type overlapChat struct{ callCount int }
+
+func (m *overlapChat) GenerateStructured(ctx context.Context, msgs []chatter.Message, target interface{}) error {
+	m.callCount++
+	if cSlice, ok := target.(*[]Comment); ok {
+		// Return two comments with same file and line to test deduplication
+		*cSlice = []Comment{{File: "dup.go", Line: 10, Message: "first"}, {File: "dup.go", Line: 10, Message: "second"}}
+		return nil
+	}
+	return nil
+}
+
+func (m *overlapChat) GetProviderName() string { return "mock" }
+
+func TestJudgePR_DeduplicationOverlap(t *testing.T) {
+	chat := &overlapChat{}
+	svc := &Service{
+		maxChunkSize: 1000,
+		guidelines:   nil,
+		git:          &mockGit{},
+		chat:         chat,
+	}
+	diffReader, err := svc.git.GetDiff(context.Background(), "", "", 0)
+	if err != nil {
+		t.Fatalf("failed to get diff: %v", err)
+	}
+	defer diffReader.Close()
+	comments, err := svc.judgePR(context.Background(), diffReader)
+	if err != nil {
+		t.Fatalf("judgePR error: %v", err)
+	}
+	if len(comments) != 1 {
+		t.Fatalf("expected 1 deduplicated comment, got %d", len(comments))
+	}
+}
@@ -0,0 +1,86 @@
+package pierre
+
+import (
+	"context"
+	"fmt"
+	"io"
+	"strings"
+
+	"git.schreifuchs.ch/schreifuchs/pierre-bot/internal/chatter"
+)
+
+// Service holds the core collaborators and configuration for Pierre.
+// The order of the fields is intentional: configuration fields first (used
+// during initialization) followed by the adapters. This prevents accidental
+// changes to the serialized layout if encoding/gob or encoding/json is used
+// elsewhere in the future.
+type Service struct {
+	maxChunkSize    int
+	guidelines      []string // stored as slice of lines; legacy, see WithGuidelines
+	skipSanityCheck bool     // if true, skip LLM sanity‑check prompts per comment
+	disableComments bool
+	git             GitAdapter
+	chat            ChatAdapter
+}
+
+func New(chat ChatAdapter, git GitAdapter, maxChunkSize int, guidelines []string, disableComments bool) *Service {
+	// Existing constructor retains slice based guidelines for backward compatibility.
+	return &Service{
+		git:             git,
+		chat:            chat,
+		maxChunkSize:    maxChunkSize,
+		guidelines:      guidelines,
+		skipSanityCheck: false,
+		disableComments: disableComments,
+	}
+}
+
+// WithGuidelines parses a raw Markdown string (or any multiline string) into
+// individual guideline lines, validates the line‑count (max 1000 non‑empty lines),
+// and stores the result in the Service. It returns an error if validation fails.
+// This is a convenience mutator for callers that have the guidelines as a
+// single string.
+func (s *Service) WithGuidelines(md string) error {
+	lines, err := parseGuidelinesFromString(md)
+	if err != nil {
+		return err
+	}
+	s.guidelines = lines
+	return nil
+}
+
+// parseGuidelinesFromString splits a markdown string into trimmed, non‑empty
+// lines and ensures the total number of lines does not exceed 1000.
+func (s *Service) SetSanityCheck(enabled bool) {
+	s.skipSanityCheck = !enabled
+}
+
+// parseGuidelinesFromString splits a markdown string into trimmed, non‑empty
+func parseGuidelinesFromString(md string) ([]string, error) {
+	var result []string
+	// Split on newline. Handles both \n and \r\n because TrimSpace removes \r.
+	rawLines := strings.Split(md, "\n")
+	for _, l := range rawLines {
+		trimmed := strings.TrimSpace(l)
+		if trimmed == "" {
+			continue
+		}
+		result = append(result, trimmed)
+	}
+	if len(result) > 1000 {
+		return nil, fmt.Errorf("guidelines exceed 1000 lines (found %d)", len(result))
+	}
+	return result, nil
+}
+
+type GitAdapter interface {
+	GetDiff(ctx context.Context, owner, repo string, prID int) (io.ReadCloser, error)
+	AddComment(ctx context.Context, owner, repo string, prID int, comment Comment) error
+	GetFileContent(ctx context.Context, owner, repo, path, ref string) (string, error)
+	GetPRHeadSHA(ctx context.Context, owner, repo string, prID int) (string, error)
+}
+
+type ChatAdapter interface {
+	GenerateStructured(ctx context.Context, messages []chatter.Message, target interface{}) error
+	GetProviderName() string
+}
@@ -0,0 +1,84 @@
+package pierre
+
+import (
+	"context"
+	"fmt"
+	"log/slog"
+
+	"git.schreifuchs.ch/schreifuchs/pierre-bot/internal/chatter"
+)
+
+func (s *Service) MakeReview(ctx context.Context, organisation string, repo string, prID int) error {
+	// Fetch Diff using positional args from shared RepoArgs
+	diff, err := s.git.GetDiff(ctx, organisation, repo, prID)
+	defer diff.Close()
+	if err != nil {
+		return fmt.Errorf("error fetching diff: %w", err)
+	}
+
+	// Run Logic
+	comments, err := s.judgePR(ctx, diff)
+	if err != nil {
+		return fmt.Errorf("error judging PR: %w", err)
+	}
+
+	// ---------- Sanity‑check step ----------
+	headSHA, err := s.git.GetPRHeadSHA(ctx, organisation, repo, prID)
+	if err != nil {
+		slog.Warn("could not fetch PR head SHA", "error", err)
+	} else if !s.skipSanityCheck {
+		filtered := []Comment{}
+		for _, c := range comments {
+			// Retrieve full file content at the PR head
+			fileContent, fErr := s.git.GetFileContent(ctx, organisation, repo, c.File, headSHA)
+			if fErr != nil {
+				slog.Warn("failed to fetch file", "path", c.File, "error", fErr)
+				filtered = append(filtered, c)
+				continue
+			}
+
+			// Build a simple sanity‑check prompt
+			systemPrompt := `You are a senior software architect. Given the full source code of a file and a review comment that refers to it, decide whether the comment is useful. Return JSON with fields "useful" (bool) and "reason" (short explanation, ≤2 sentences).`
+			userPrompt := fmt.Sprintf("File content:\n%s\n\nComment:\n%s", fileContent, c.Message)
+
+			type sanityResult struct {
+				Useful bool   `json:"useful"`
+				Reason string `json:"reason"`
+			}
+			var res sanityResult
+			if err := s.chat.GenerateStructured(ctx, []chatter.Message{{Role: chatter.RoleSystem, Content: systemPrompt}, {Role: chatter.RoleUser, Content: userPrompt}}, &res); err != nil {
+				slog.Error("sanity check error", "file", c.File, "line", c.Line, "error", err)
+				filtered = append(filtered, c)
+				continue
+			}
+			if res.Useful {
+				// Optionally annotate the comment with the reason for debugging
+				c.Message = fmt.Sprintf("%s (Reason: %s)", c.Message, res.Reason)
+				filtered = append(filtered, c)
+			} else {
+				slog.Info("comment discarded", "file", c.File, "line", c.Line, "reason", res.Reason)
+			}
+		}
+		comments = filtered
+	}
+
+	fmt.Printf("Analysis complete. Found %d issues.\n---\n", len(comments))
+
+	model := s.chat.GetProviderName()
+
+	for _, c := range comments {
+		c.Message = fmt.Sprintf("%s (Generated by: %s)", c.Message, model)
+
+		// Normal mode: print to stdout and post the comment to the VCS.
+		fmt.Printf("File: %s\nLine: %d\nMessage: %s\n%s\n",
+			c.File, c.Line, c.Message, "---")
+
+		if !s.disableComments {
+			if err := s.git.AddComment(ctx, organisation, repo, prID, c); err != nil {
+				slog.Error("failed to add comment", "error", err)
+			}
+		}
+	}
+
+	return nil
+}
Author	SHA1	Message	Date
u80864958	75ef8da1a4	feat: guidelines	2026-02-13 18:33:21 +01:00
u80864958	2e12c39786	feat: add logs	2026-02-13 18:04:09 +01:00
u80864958	343f6ab165	feat(pierre): sanity check	2026-02-13 17:35:39 +01:00
u80864958	cc321be658	feat(pierre): add diff chunking and configurable review settings	2026-02-13 17:09:18 +01:00
u80864958	2cb64194b9	feat: correctly implement bitbucket & add OpenAIAdapter	2026-02-13 14:31:20 +01:00
schreifuchs	b67125024c	Merge pull request 'feat: gitea client' (#1 ) from feat/gitea into main Reviewed-on: #1	2026-02-13 09:30:02 +01:00
schreifuchs	9d49d94eff	feat: gitea client	2026-02-12 22:05:38 +01:00