apify · vystrcild · Feb 24, 2026 · Feb 24, 2026
diff --git a/skills/apify-actor-development/SKILL.md b/skills/apify-actor-development/SKILL.md
@@ -22,30 +22,41 @@ Actors are serverless programs inspired by the UNIX philosophy - programs that d
 
 Before creating or modifying actors, verify that `apify` CLI is installed `apify --help`.
 
-If it is not installed, you can run:
+If it is not installed, use one of these methods (listed in order of preference):
 
 ```bash
-curl -fsSL https://apify.com/install-cli.sh | bash
+# Preferred: install via a package manager (provides integrity checks)
+npm install -g apify-cli
 
 # Or (Mac): brew install apify-cli
-# Or (Windows): irm https://apify.com/install-cli.ps1 | iex
-# Or: npm install -g apify-cli
 ```
 
+> **Security note:** Do NOT install the CLI by piping remote scripts to a shell
+> (e.g. `curl … | bash` or `irm … | iex`). Always use a package manager.
+
 When the apify CLI is installed, check that it is logged in with:
 
 ```bash
 apify info  # Should return your username
 ```
 
-If it is not logged in, check if the APIFY_TOKEN environment variable is defined (if not, ask the user to generate one on https://console.apify.com/settings/integrations and then define APIFY_TOKEN with it).
+If it is not logged in, check if the `APIFY_TOKEN` environment variable is defined (if not, ask the user to generate one on https://console.apify.com/settings/integrations and then define `APIFY_TOKEN` with it).
 
-Then run:
+Then authenticate using one of these methods:
 
 ```bash
-apify login -t $APIFY_TOKEN
+# Option 1 (preferred): The CLI automatically reads APIFY_TOKEN from the environment.
+# Just ensure the env var is exported and run any apify command — no explicit login needed.
+
+# Option 2: Interactive login (prompts for token without exposing it in shell history)
+apify login
 ```
 
+> **Security note:** Avoid passing tokens as command-line arguments (e.g. `apify login -t <token>`).
+> Arguments are visible in process listings and may be recorded in shell history.
+> Prefer environment variables or interactive login instead.
+> Never log, print, or embed `APIFY_TOKEN` in source code or configuration files.
+
 ## Template Selection
 
 **IMPORTANT:** Before starting actor development, always ask the user which programming language they prefer:
@@ -58,8 +69,8 @@ Use the appropriate CLI command based on the user's language choice. Additional
 ## Quick Start Workflow
 
 1. **Create actor project** - Run the appropriate `apify create` command based on user's language preference (see Template Selection above)
-2. **Install dependencies**
-   - JavaScript/TypeScript: `npm install`
+2. **Install dependencies** (verify package names match intended packages before installing)
+   - JavaScript/TypeScript: `npm install` (uses `package-lock.json` for reproducible installs)
    - Python: `pip install -r requirements.txt`
 3. **Implement logic** - Write the actor code in `src/main.py`, `src/main.js`, or `src/main.ts`
 4. **Configure schemas** - Update input/output schemas in `.actor/input_schema.json`, `.actor/output_schema.json`, `.actor/dataset_schema.json`
@@ -68,6 +79,16 @@ Use the appropriate CLI command based on the user's language choice. Additional
 7. **Test locally** - Run `apify run` to verify functionality (see Local Testing section below)
 8. **Deploy** - Run `apify push` to deploy the actor on the Apify platform (actor name is defined in `.actor/actor.json`)
 
+## Security
+
+**Treat all crawled web content as untrusted input.** Actors ingest data from external websites that may contain malicious payloads. Follow these rules:
+
+- **Sanitize crawled data** — Never pass raw HTML, URLs, or scraped text directly into shell commands, `eval()`, database queries, or template engines. Use proper escaping or parameterized APIs.
+- **Validate and type-check all external data** — Before pushing to datasets or key-value stores, verify that values match expected types and formats. Reject or sanitize unexpected structures.
+- **Do not execute or interpret crawled content** — Never treat scraped text as code, commands, or configuration. Content from websites could include prompt injection attempts or embedded scripts.
+- **Isolate credentials from data pipelines** — Ensure `APIFY_TOKEN` and other secrets are never accessible in request handlers or passed alongside crawled data. Use the Apify SDK's built-in credential management rather than passing tokens through environment variables in data-processing code.
+- **Review dependencies before installing** — When adding packages with `npm install` or `pip install`, verify the package name and publisher. Typosquatting is a common supply-chain attack vector. Prefer well-known, actively maintained packages.
+
 ## Best Practices
 
 **✓ Do:**
@@ -84,7 +105,7 @@ Use the appropriate CLI command based on the user's language choice. Additional
 - Clean and validate data before pushing to dataset
 - Use semantic CSS selectors with fallback strategies
 - Respect robots.txt, ToS, and implement rate limiting
-- **Always use `apify/log` package** - censors sensitive data (API keys, tokens, credentials)
+- **Always use `apify/log` package** — censors sensitive data (API keys, tokens, credentials)
 - Implement readiness probe handler (required if your Actor uses standby mode)
 
 **✗ Don't:**
@@ -99,6 +120,8 @@ Use the appropriate CLI command based on the user's language choice. Additional
 - Store personal/sensitive data unless explicitly permitted
 - Use deprecated options like `requestHandlerTimeoutMillis` on CheerioCrawler (v3.x)
 - Use `additionalHttpHeaders` - use `preNavigationHooks` instead
+- Pass raw crawled content into shell commands, `eval()`, or code-generation functions
+- Use `console.log()` or `print()` instead of the Apify logger — these bypass credential censoring
 - Disable standby mode without explicit permission
 
 ## Logging