feat: add workflow and script to check edit links on docs #3557

anshgoyalevil · 2025-01-08T15:09:14Z

Description

This PR adds a script that programatically creates the doc URLs and fetch their head to check if their exist an edit link on AsyncAPI website.
The workflow would run once a week at midnight.
If their were any errors, the workflow would dump the error to Slack. Example message: https://asyncapi.slack.com/archives/C0884HHQZS5/p1736348664768859

Related issue(s)
fixes #3221

Summary by CodeRabbit

Release Notes

New Features
- Added a weekly automated link checker to verify documentation links.
- Introduced a new script to check and validate edit links in markdown files.
Improvements
- Enhanced edit link generation for documentation pages.
- Updated documentation link configurations.
Testing
- Added a comprehensive test suite for link checking functionality.
- Improved test coverage for URL validation processes.
Chores
- Updated GitHub Actions workflow.
- Minor configuration file adjustments.

coderabbitai · 2025-01-08T15:09:23Z

Walkthrough

A comprehensive enhancement to the documentation link checking and editing workflow has been implemented. The changes introduce a new GitHub Actions workflow for weekly link validation, modify the edit link generation logic in the DocsLayout component, update configuration files, and add a new script for checking edit links. The implementation includes robust error handling, path resolution, and URL validation mechanisms to ensure the accuracy of documentation edit links across the repository.

Changes

File	Change Summary
`.github/workflows/check-edit-links.yml`	New workflow added for weekly documentation link checking
`components/layout/DocsLayout.tsx`	Modified `generateEditLink` function to handle `.mdx` to `.md` link conversion
`config/edit-page-config.json`	Updated GitHub documentation URL path
`package.json`	Added new script `test:editlinks` for link checking
`jest.config.js`	Minor formatting change
`tests/fixtures/markdown/check-editlinks-data.js`	Added test data constants for link checking
`scripts/dashboard/build-dashboard.js`	Exported `pause` function
`scripts/markdown/check-edit-links.js`	New script for comprehensive edit link validation
`tests/markdown/check-edit-links.test.js`	Added comprehensive test suite for link checking functionality

Assessment against linked issues

Objective	Addressed	Explanation
Fix incorrect "Edit this page on GitHub" links	✅
Validate documentation edit links	✅

Poem

🐰 Hop, hop, through markdown's maze,
Links checked with algorithmic praise,
No broken paths shall make us fret,
Our docs now shine, a perfect set!
Rabbit's code, a linking delight! 🔗

Finishing Touches

📝 Generate Docstrings (Beta)

Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Generate unit testing code for this file.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai generate unit testing code for this file.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and generate unit testing code.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai generate docstrings to generate docstrings for this PR. (Beta)
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

netlify · 2025-01-08T15:09:33Z

✅ Deploy Preview for asyncapi-website ready!

Built without sensitive environment variables

Name	Link
🔨 Latest commit	`22a31ce`
🔍 Latest deploy log	https://app.netlify.com/sites/asyncapi-website/deploys/678692f40c88d90008103fd5
😎 Deploy Preview	https://deploy-preview-3557--asyncapi-website.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

asyncapi-bot · 2025-01-08T15:13:12Z

⚡️ Lighthouse report for the changes in this PR:

Category	Score
🔴 Performance	38
🟢 Accessibility	98
🟢 Best practices	92
🟢 SEO	100
🔴 PWA	33

Lighthouse ran on https://deploy-preview-3557--asyncapi-website.netlify.app/

coderabbitai

Actionable comments posted: 2

🧹 Nitpick comments (2)

.github/workflows/check-edit-links.yml (2)
32-37: Improve shell script robustness.

The error extraction step could be more robust by following shell scripting best practices.
 - name: Extract 404 URLs from output
   id: extract-404
   run: |
-    ERRORS=$(sed -n '/URLs returning 404:/,$p' output.log)
-    echo "errors<<EOF" >> $GITHUB_OUTPUT
-    echo "$ERRORS" >> $GITHUB_OUTPUT
-    echo "EOF" >> $GITHUB_OUTPUT
+    {
+      ERRORS="$(sed -n '/URLs returning 404:/,$p' output.log)"
+      echo "errors<<EOF"
+      echo "${ERRORS}"
+      echo "EOF"
+    } >> "$GITHUB_OUTPUT"
🧰 Tools

🪛 actionlint (1.7.4)

32-32: shellcheck reported issue in this script: SC2129:style:2:1: Consider using { cmd1; cmd2; } >> file instead of individual redirects

(shellcheck)

32-32: shellcheck reported issue in this script: SC2086:info:2:23: Double quote to prevent globbing and word splitting

(shellcheck)

32-32: shellcheck reported issue in this script: SC2086:info:3:19: Double quote to prevent globbing and word splitting

(shellcheck)

32-32: shellcheck reported issue in this script: SC2086:info:4:15: Double quote to prevent globbing and word splitting

(shellcheck)

38-49: Consider adding error categorization in Slack notification.

The Slack notification could be more informative by categorizing errors.
 SLACK_MESSAGE: |
   🚨 The following URLs returned 404 during the link check:
${{ steps.extract-404.outputs.errors }}
+  Note: These errors might be due to:
+  - Missing documentation files
+  - Incorrect edit link paths
+  - Repository structure changes

📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 4d002ce and 689a861.

📒 Files selected for processing (5)

.github/workflows/check-edit-links.yml (1 hunks)
components/layout/DocsLayout.tsx (1 hunks)
config/edit-page-config.json (2 hunks)
package.json (1 hunks)
scripts/markdown/check-editlinks.js (1 hunks)

🧰 Additional context used

🪛 eslint

scripts/markdown/check-editlinks.js

[error] 58-58: Unexpected await inside a loop.

(no-await-in-loop)

[error] 59-59: Unexpected await inside a loop.

(no-await-in-loop)

🪛 actionlint (1.7.4)

.github/workflows/check-edit-links.yml

32-32: shellcheck reported issue in this script: SC2129:style:2:1: Consider using { cmd1; cmd2; } >> file instead of individual redirects

(shellcheck)

32-32: shellcheck reported issue in this script: SC2086:info:2:23: Double quote to prevent globbing and word splitting

(shellcheck)

32-32: shellcheck reported issue in this script: SC2086:info:3:19: Double quote to prevent globbing and word splitting

(shellcheck)

32-32: shellcheck reported issue in this script: SC2086:info:4:15: Double quote to prevent globbing and word splitting

(shellcheck)

⏰ Context from checks skipped due to timeout of 180000ms (1)

GitHub Check: Lighthouse CI

🔇 Additional comments (4)

components/layout/DocsLayout.tsx (1)

37-39: Ensure correct conversion of .mdx to .md in edit links

By replacing .mdx with .md in edit links, please verify that all .mdx files correspond to .md files in the repository. Ensure that this change does not introduce broken edit links for any files that should remain as .mdx.

config/edit-page-config.json (1)

4-4: Verify the updated href path for the AsyncAPI Generator docs

The href for /tools/generator has been updated to a new path. Please confirm that https://github.com/asyncapi/generator/tree/master/apps/generator/docs is the correct location for the generator's documentation and that the link is accessible.

.github/workflows/check-edit-links.yml (1)

3-7: LGTM! Well-configured workflow triggers.

The workflow is appropriately scheduled to run weekly and includes manual trigger capability.
package.json (1)

27-27: Verify the edit links checker script and its dependencies.

The script addition looks good, but let's ensure the required files and dependencies are in place.

✅ Verification successful

✓ Edit links checker implementation is complete and properly configured

All required dependencies and configuration files are in place:

node-fetch-2 is available in package.json

edit-page-config.json exists with valid mappings
🏁 Scripts executed

The following scripts were executed for the analysis:

Script:
#!/bin/bash
# Description: Verify the existence of the edit links checker script and related files

# Check if the script file exists
if ! [ -f "scripts/markdown/check-editlinks.js" ]; then
  echo "Error: check-editlinks.js script is missing"
  exit 1
fi

# Check for required dependencies in package.json
rg "scripts/markdown/check-editlinks.js" -l
Length of output: 105

Script:
#!/bin/bash
# Check script content and dependencies
cat scripts/markdown/check-editlinks.js

# Also check package.json for any related dependencies
cat package.json | jq '.dependencies, .devDependencies'
Length of output: 9008

Script:
#!/bin/bash
# Check if the config file exists
if [ -f "config/edit-page-config.json" ]; then
    echo "Config file exists"
    cat config/edit-page-config.json
else
    echo "Config file missing"
fi
Length of output: 705

coderabbitai · 2025-01-08T15:13:50Z

scripts/markdown/check-editlinks.js

+    const batchResults = await processBatch(batch);
+    await pause(1000);


🛠️ Refactor suggestion

Consider refactoring to avoid 'await' inside a loop

Static analysis tools have flagged the use of await inside a loop on lines 58-59. Using await in a loop can lead to sequential execution and potential performance issues. Consider refactoring the code to process batches concurrently.

Apply this diff to refactor the code:

- for (let i = 0; i < paths.length; i += batchSize) { - console.log(`Processing batch ${Math.floor(i / batchSize) + 1}/${Math.ceil(paths.length / batchSize)}`); - const batch = paths.slice(i, i + batchSize); - const batchResults = await processBatch(batch); - await pause(1000); - - // Filter out null results and add valid URLs to results - result.push(...batchResults.filter((url) => url !== null)); - } + const batches = []; + for (let i = 0; i < paths.length; i += batchSize) { + const batch = paths.slice(i, i + batchSize); + batches.push(batch); + } + + console.log(`Processing ${batches.length} batches concurrently...`); + const batchResultsArray = await Promise.all( + batches.map(async (batch) => { + const batchResults = await processBatch(batch); + // Introduce a pause between batches if necessary + await pause(1000); + return batchResults.filter((url) => url !== null); + }) + ); + + // Flatten the array of results + result.push(...batchResultsArray.flat());

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

const batchResults = await processBatch(batch);

await pause(1000);

const batches = [];

for (let i = 0; i < paths.length; i += batchSize) {

const batch = paths.slice(i, i + batchSize);

batches.push(batch);

}

console.log(`Processing ${batches.length} batches concurrently...`);

const batchResultsArray = await Promise.all(

batches.map(async (batch) => {

const batchResults = await processBatch(batch);

// Introduce a pause between batches if necessary

await pause(1000);

return batchResults.filter((url) => url !== null);

})

);

// Flatten the array of results

result.push(...batchResultsArray.flat());

🧰 Tools

🪛 eslint

[error] 58-58: Unexpected await inside a loop.

(no-await-in-loop)

[error] 59-59: Unexpected await inside a loop.

(no-await-in-loop)

We already use such a logic inside our Dashboard script and it works great. let's keep a uniform style

@anshgoyalevil Understood, I'll respect the existing pattern used in the Dashboard script. Thanks for the clarification!

(｀･ω･´)ゞ

✏️ Learnings added

Learnt from: anshgoyalevil PR: asyncapi/website#3557 File: scripts/markdown/check-editlinks.js:58-59 Timestamp: 2025-01-08T15:15:00.620Z Learning: In the AsyncAPI codebase, batch processing operations (like in the Dashboard script and check-editlinks.js) follow a sequential pattern using await in loops, which is the preferred approach for maintaining consistency across the codebase.

Note: Learnings are effective only in the context of similar code segments. To apply general coding guidelines, please configure review instructions. You can manage existing learnings in the app.

@anshgoyalevil The pattern used in the dashboard script is not appropriate. I think CodeRabbitAI has suggested a more configurable approach where you can change the size of the batch, using a variable. Try to implement this suggestion.

Seems like the humans are having a chat. I'll hop back into my burrow for now. If you need me again, just tag @coderabbitai in a new comment, and I'll come hopping out!

.github/workflows/check-edit-links.yml

codecov · 2025-01-08T15:16:12Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 100.00%. Comparing base (78c6f2d) to head (22a31ce).

Additional details and impacted files

@@            Coverage Diff            @@
##            master     #3557   +/-   ##
=========================================
  Coverage   100.00%   100.00%           
=========================================
  Files           19        20    +1     
  Lines          668       731   +63     
=========================================
+ Hits           668       731   +63

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

anshgoyalevil · 2025-01-08T15:21:19Z

jest.config.js

+    'scripts/compose.js',
+    'scripts/tools/categorylist.js',
+    'scripts/tools/tags-color.js',
+    'scripts/markdown/check-editlinks.js'


I have added the new script to ignore Jest coverage to pass CI. Will create a new good first issue for other contributors to add test

I think we should have a practice to add test along with the code, so don't add the file here. Instead add the relevant tests for the file.

uhm okay. will add it

coderabbitai

Actionable comments posted: 1

📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 689a861 and c13eb87.

📒 Files selected for processing (1)

jest.config.js (1 hunks)

⏰ Context from checks skipped due to timeout of 180000ms (3)

GitHub Check: Redirect rules - asyncapi-website
GitHub Check: Header rules - asyncapi-website
GitHub Check: Pages changed - asyncapi-website

🔇 Additional comments (1)

jest.config.js (1)

14-14: LGTM!

The testMatch pattern correctly excludes Netlify edge function tests while including other test files.

jest.config.js

aeworxet · 2025-01-09T10:56:40Z

@asyncapi/bounty_team

akshatnema · 2025-01-11T13:29:28Z

scripts/markdown/check-editlinks.js

+
+    if (invalidUrls.length === 0) {
+      console.log('All URLs are valid.');
+      process.exit(0);


We shouldn't use process.exit functions like this. Make the conditional rendering more appropriate. Like, make this if block for invalidUrls only.

coderabbitai

Actionable comments posted: 2

🧹 Nitpick comments (3)

tests/fixtures/markdown/check-editlinks-data.js (1)

3-5: Ensure consistency in URL path formats

The URL paths in the test data are inconsistent - some start with '/' while others don't. This inconsistency could lead to issues in path matching and URL generation.

Apply this diff to maintain consistency:
-    urlPath: 'docs/concepts/application',
+    urlPath: '/docs/concepts/application',
-    urlPath: 'concepts/application',
+    urlPath: '/concepts/application',
-    urlPath: '/tools/cli',
+    urlPath: 'tools/cli',
Also applies to: 8-10, 13-15

tests/markdown/check-editlinks.test.js (1)

90-93: Add timeout handling test case

The network error test case only covers generic errors. Add a specific test case for request timeouts.

Add this test case:
it('should handle request timeouts', async () => {
  fetch.mockImplementation(() => new Promise((_, reject) => {
    setTimeout(() => reject(new Error('Request timeout')), 5000);
  }));
  await expect(processBatch(testBatch)).rejects.toThrow('Request timeout');
});

scripts/markdown/check-editlinks.js (1)

103-136: Implement pagination for large directories

The generatePaths function loads all files into memory at once, which could cause issues with large directories.

Consider implementing pagination to process files in chunks:

async function* generatePathsIterator(folderPath, editOptions, batchSize = 100) {
  const stack = [{ path: folderPath, relativePath: '' }];
  let batch = [];

  while (stack.length > 0) {
    const { path: currentPath, relativePath } = stack.pop();
    const files = await fs.readdir(currentPath);

    for (const file of files) {
      const filePath = path.join(currentPath, file);
      const relativeFilePath = path.join(relativePath, file);
      const stats = await fs.stat(filePath);

      if (stats.isDirectory()) {
        stack.push({ path: filePath, relativePath: relativeFilePath });
      } else if (stats.isFile() && file.endsWith('.md') && file !== '_section.md') {
        const urlPath = relativeFilePath.split(path.sep).join('/').replace('.md', '');
        batch.push({
          filePath,
          urlPath,
          editLink: determineEditLink(urlPath, filePath, editOptions)
        });

        if (batch.length >= batchSize) {
          yield batch;
          batch = [];
        }
      }
    }
  }

  if (batch.length > 0) {
    yield batch;
  }
}

📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between c13eb87 and 23a3f6d.

📒 Files selected for processing (6)

jest.config.js (1 hunks)
package.json (1 hunks)
scripts/markdown/check-editlinks.js (1 hunks)
tests/fixtures/markdown/check-editlinks-data.js (1 hunks)
tests/fixtures/markdown/edit-page-config.json (1 hunks)
tests/markdown/check-editlinks.test.js (1 hunks)

🚧 Files skipped from review as they are similar to previous changes (2)

package.json
jest.config.js

🧰 Additional context used

📓 Learnings (1)

scripts/markdown/check-editlinks.js (1)

Learnt from: anshgoyalevil
PR: asyncapi/website#3557
File: scripts/markdown/check-editlinks.js:58-59
Timestamp: 2025-01-08T15:15:00.759Z
Learning: In the AsyncAPI codebase, batch processing operations (like in the Dashboard script and check-editlinks.js) follow a sequential pattern using await in loops, which is the preferred approach for maintaining consistency across the codebase.

🔇 Additional comments (2)

tests/fixtures/markdown/edit-page-config.json (1)

15-17: Verify empty value fallback handling

The empty value entry is used as a fallback, but this could lead to incorrect URL generation if not handled carefully. Ensure that the determineEditLink function properly validates the target before using it.

✅ Verification successful

Empty value fallback is properly implemented and tested

The empty value entry in the configuration is intentionally used as a fallback, with proper handling in the determineEditLink function. The implementation adds necessary path components and has comprehensive test coverage verifying this behavior.
🏁 Scripts executed

The following scripts were executed for the analysis:

Script:
#!/bin/bash
# Check if determineEditLink handles null/undefined targets
ast-grep --pattern 'function determineEditLink($_, $_, $_) {
  $$$
  if (!target) {
    $$$
  }
  $$$
}'
Length of output: 106

Script:
#!/bin/bash
# Search for determineEditLink function with broader context
rg -l "determineEditLink"

# Search for files containing both "edit" and "link"
rg -l "edit.*link|link.*edit" --type js --type ts

# Look for test files related to edit links
fd "edit.*test|test.*edit" -e js -e ts -e jsx -e tsx
Length of output: 402

Script:
#!/bin/bash
# Check implementation
cat scripts/markdown/check-editlinks.js

# Check test cases
cat tests/markdown/check-editlinks.test.js

# Check test data
cat tests/fixtures/markdown/check-editlinks-data.js
Length of output: 13950
scripts/markdown/check-editlinks.js (1)

147-153: 🛠️ Refactor suggestion

Improve error handling in main function

The main function's error handling could be more informative and should set appropriate exit codes.

Apply this diff to improve error handling:
   if (invalidUrls.length > 0) {
     console.log('\nURLs returning 404:\n');
     invalidUrls.forEach((url) => console.log(`- ${url.editLink} generated from ${url.filePath}\n`));
     console.log(`\nTotal invalid URLs found: ${invalidUrls.length}`);
+    process.exitCode = 1;
   } else {
     console.log('All URLs are valid.');
+    process.exitCode = 0;
   }
Likely invalid or redundant comment.

scripts/markdown/check-editlinks.js

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (4)

scripts/markdown/check-editlinks.js (4)
3-3: Consider using native fetch or updating to latest node-fetch

The code uses node-fetch-2 which is an older version. Consider using the native fetch API (available since Node.js 18) or updating to the latest version of node-fetch.
-const fetch = require('node-fetch-2');
+const fetch = (...args) => import('node-fetch').then(({default: fetch}) => fetch(...args));
6-10: Move ignore list to configuration file

Consider moving the ignoreFiles array to the configuration file (edit-page-config.json) to make it more maintainable and configurable without code changes.

142-142: Enhance error message detail

The error message could be more informative by including the original error details.
-    throw new Error(`Error processing directory ${folderPath}:`, err);
+    throw new Error(`Error processing directory ${folderPath}: ${err.message}`);
163-163: Enhance error message detail

Similar to the previous suggestion, include the original error message for better debugging.
-    throw new Error('Failed to check edit links:', error);
+    throw new Error(`Failed to check edit links: ${error.message}`);

📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 23a3f6d and c101b66.

📒 Files selected for processing (1)

scripts/markdown/check-editlinks.js (1 hunks)

🧰 Additional context used

📓 Learnings (1)

scripts/markdown/check-editlinks.js (2)

Learnt from: anshgoyalevil
PR: asyncapi/website#3557
File: scripts/markdown/check-editlinks.js:80-93
Timestamp: 2025-01-14T09:23:32.728Z
Learning: In the AsyncAPI website's edit link generation system, the `editOptions` array in `edit-page-config.json` includes a fallback entry with an empty string value (`''`) that matches any URL path, ensuring that `determineEditLink()` function always finds a target and never returns null.

Learnt from: anshgoyalevil
PR: asyncapi/website#3557
File: scripts/markdown/check-editlinks.js:58-59
Timestamp: 2025-01-08T15:15:00.759Z
Learning: In the AsyncAPI codebase, batch processing operations (like in the Dashboard script and check-editlinks.js) follow a sequential pattern using await in loops, which is the preferred approach for maintaining consistency across the codebase.

🔇 Additional comments (4)

scripts/markdown/check-editlinks.js (4)

12-20: LGTM! Well-documented helper function

The pause function is well-implemented with clear JSDoc documentation.

58-79: Align batch processing pattern with codebase standards

The current implementation uses concurrent processing with Promise.all, but based on the codebase patterns (as seen in the Dashboard script), sequential processing is preferred.

Would you like to maintain consistency with the codebase's sequential pattern or keep the current concurrent implementation? Please confirm your preference.

88-101: LGTM! Robust edit link determination

The implementation correctly handles the fallback case and special path prefixes. The code aligns with the established pattern where an empty string value in editOptions serves as a catch-all fallback.

172-172: LGTM! Clean exports

The module exports are well-organized and include all necessary functions.

.github/workflows/check-edit-links.yml

scripts/markdown/check-editlinks.js

tests/fixtures/markdown/edit-page-config.json

tests/markdown/check-editlinks.test.js

akshatnema · 2025-01-14T14:34:56Z

scripts/markdown/check-editlinks.js

@@ -0,0 +1,172 @@
+const fs = require('fs').promises;
+const path = require('path');
+const fetch = require('node-fetch-2');


Why node-fetch-2 instead of normal fetch API?

The module problem we had—I don't exactly remember what it was but we weren't able to use node-fetch because of some version issue.

Here's the PR:
#3038

.github/workflows/check-edit-links.yml

akshatnema · 2025-01-14T14:35:59Z

.github/workflows/check-edit-links.yml

+            ```
+            ${{ steps.extract-404.outputs.errors }}
+            ```
+          MSG_MINIMAL: true


What does this parameter state for Slack?

It removes a bunch of useless information from the message.

coderabbitai

Actionable comments posted: 4

🧹 Nitpick comments (3)

.github/workflows/check-edit-links.yml (3)
4-6: Add detailed cron syntax documentation.

While the cron syntax is correct, consider adding a more detailed comment explaining each field for better maintainability:
   schedule:
-    - cron: '0 0 * * 0' # Runs every week at midnight on Sunday
+    - cron: '0 0 * * 0' # minute(0) hour(0) day(*) month(*) weekday(0=Sunday) - Runs every Sunday at midnight UTC
38-40: Consider moving the if condition.

As suggested in the past review, consider moving the if condition above the step name for better readability and to follow the standard pattern in GitHub Actions:
-      - name: Notify Slack
-        if: ${{ steps.extract-404.outputs.errors != '' }}
+      - if: ${{ steps.extract-404.outputs.errors != '' }}
+        name: Notify Slack
49-49: Document the MSG_MINIMAL parameter.

The MSG_MINIMAL parameter's purpose isn't immediately clear. Add a comment explaining its effect on the Slack notification:
-          MSG_MINIMAL: true
+          MSG_MINIMAL: true # Sends a compact message without additional metadata like GitHub workflow details

📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between c101b66 and b035fc4.

📒 Files selected for processing (5)

.github/workflows/check-edit-links.yml (1 hunks)
package.json (1 hunks)
scripts/dashboard/build-dashboard.js (1 hunks)
scripts/markdown/check-edit-links.js (1 hunks)
tests/markdown/check-edit-links.test.js (1 hunks)

🚧 Files skipped from review as they are similar to previous changes (1)

package.json

🧰 Additional context used

📓 Learnings (2)

scripts/markdown/check-edit-links.js (1)

Learnt from: anshgoyalevil
PR: asyncapi/website#3557
File: scripts/markdown/check-editlinks.js:58-59
Timestamp: 2025-01-08T15:15:00.759Z
Learning: In the AsyncAPI codebase, batch processing operations (like in the Dashboard script and check-editlinks.js) follow a sequential pattern using await in loops, which is the preferred approach for maintaining consistency across the codebase.

.github/workflows/check-edit-links.yml (1)

Learnt from: anshgoyalevil
PR: asyncapi/website#3557
File: .github/workflows/check-edit-links.yml:25-29
Timestamp: 2025-01-08T15:16:27.655Z
Learning: In GitHub workflows running scripts with process.exit statements for error handling (like check-editlinks.js in asyncapi/website), avoid adding error suppression (|| true) at the workflow level as it would mask the intended error reporting mechanism.

🪛 eslint

scripts/dashboard/build-dashboard.js

[error] 184-184: Replace ·getLabel,·monthsSince,·mapGoodFirstIssues,·getHotDiscussions,·getDiscussionByID,·getDiscussions,·writeToFile,·start,·processHotDiscussions,·pause· with ⏎··getLabel,⏎··monthsSince,⏎··mapGoodFirstIssues,⏎··getHotDiscussions,⏎··getDiscussionByID,⏎··getDiscussions,⏎··writeToFile,⏎··start,⏎··processHotDiscussions,⏎··pause⏎

(prettier/prettier)

🪛 actionlint (1.7.4)

.github/workflows/check-edit-links.yml

32-32: shellcheck reported issue in this script: SC2129:style:2:1: Consider using { cmd1; cmd2; } >> file instead of individual redirects

(shellcheck)

32-32: shellcheck reported issue in this script: SC2086:info:2:23: Double quote to prevent globbing and word splitting

(shellcheck)

32-32: shellcheck reported issue in this script: SC2086:info:3:19: Double quote to prevent globbing and word splitting

(shellcheck)

32-32: shellcheck reported issue in this script: SC2086:info:4:15: Double quote to prevent globbing and word splitting

(shellcheck)

⏰ Context from checks skipped due to timeout of 180000ms (1)

GitHub Check: Test NodeJS PR - macos-13

🔇 Additional comments (6)

scripts/markdown/check-edit-links.js (2)

83-93: ⚠️ Potential issue

Handle undefined 'target' to prevent runtime errors in 'determineEditLink'.

In the determineEditLink function, if no matching edit option is found, target could be undefined, leading to a TypeError when accessing target.value. Please add a check to handle the case when target is undefined to prevent runtime errors.

Apply this diff to fix the issue:
 function determineEditLink(urlPath, filePath, editOptions) {
   // Remove leading 'docs/' if present for matching
   const pathForMatching = urlPath.startsWith('docs/') ? urlPath.slice(5) : urlPath;

   const target = editOptions.find((edit) => pathForMatching.includes(edit.value));

+  if (!target) {
+    // Handle the case where no matching edit option is found
+    return null; // Or handle appropriately based on your application's needs
+  }

   // Handle the empty value case (fallback)
   if (target.value === '') {
     return `${target.href}/docs/${urlPath}.md`;
   }

   // For other cases with specific targets
   return `${target.href}/${path.basename(filePath)}`;
 }
Ensure that downstream code can handle a null return value from this function, or adjust the handling accordingly.

3-3: ⚠️ Potential issue

Correct the import statement for 'node-fetch'.

It appears that 'node-fetch-2' is not a valid package. To import version 2 of node-fetch, you should use 'node-fetch'. Please update the import statement accordingly.

Apply this diff to fix the import statement:
-const fetch = require('node-fetch-2');
+const fetch = require('node-fetch');
Ensure that the node-fetch package is installed in your package.json dependencies.
scripts/dashboard/build-dashboard.js (1)

184-184: Reformat 'module.exports' for better readability as per Prettier guidelines.

The static analysis tool indicates that the module.exports statement should be reformatted for consistency and readability.

Apply this diff to format module.exports:
-module.exports = { getLabel, monthsSince, mapGoodFirstIssues, getHotDiscussions, getDiscussionByID, getDiscussions, writeToFile, start, processHotDiscussions, pause };
+module.exports = {
+  getLabel,
+  monthsSince,
+  mapGoodFirstIssues,
+  getHotDiscussions,
+  getDiscussionByID,
+  getDiscussions,
+  writeToFile,
+  start,
+  processHotDiscussions,
+  pause
+};
Ensure that the updated formatting aligns with the project's coding standards and passes the linter checks.

🧰 Tools

🪛 eslint

[error] 184-184: Replace ·getLabel,·monthsSince,·mapGoodFirstIssues,·getHotDiscussions,·getDiscussionByID,·getDiscussions,·writeToFile,·start,·processHotDiscussions,·pause· with ⏎··getLabel,⏎··monthsSince,⏎··mapGoodFirstIssues,⏎··getHotDiscussions,⏎··getDiscussionByID,⏎··getDiscussions,⏎··writeToFile,⏎··start,⏎··processHotDiscussions,⏎··pause⏎

(prettier/prettier)
tests/markdown/check-edit-links.test.js (1)

2-2: ⚠️ Potential issue

Correct import and mocking of 'node-fetch'.

It appears that 'node-fetch-2' is not a valid package name. Replace 'node-fetch-2' with 'node-fetch' in both the import statement and the Jest mock.

Apply these diffs:

At line 2:
-const fetch = require('node-fetch-2');
+const fetch = require('node-fetch');
At line 13:
-jest.mock('node-fetch-2', () => jest.fn());
+jest.mock('node-fetch', () => jest.fn());
Ensure that the node-fetch package is correctly installed and updated in your package.json dependencies.

Also applies to: 13-13
.github/workflows/check-edit-links.yml (2)

9-11: LGTM! Well-structured job configuration.

The job configuration follows best practices with clear naming and appropriate runner selection.

25-29: Approved: Proper error handling implementation.

The link checker step correctly allows the script's built-in error handling to work without suppression, as noted in the previous learning. This ensures proper error reporting and Slack notifications.

scripts/markdown/check-edit-links.js

tests/markdown/check-edit-links.test.js

.github/workflows/check-edit-links.yml

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (3)

scripts/markdown/check-edit-links.js (3)
3-3: Consider using native fetch or updating to latest node-fetch.

The script uses node-fetch-2 which is an older version. Consider:

Using the native fetch API (available since Node.js 18.0.0)

Updating to the latest node-fetch version
-const fetch = require('node-fetch-2');
+// If using Node.js >= 18.0.0
+const fetch = (...args) => import('node-fetch').then(({default: fetch}) => fetch(...args));
19-19: Extract timeout duration as a named constant.

The timeout duration is a magic number. Consider extracting it to a named constant at the module level for better maintainability.
+const URL_CHECK_TIMEOUT_MS = 5000;
+
 async function processBatch(batch) {
-  const TIMEOUT_MS = 5000;
+  const TIMEOUT_MS = URL_CHECK_TIMEOUT_MS;
51-51: Extract batch size as a named constant.

The batch size is a magic number. Consider extracting it to a named constant at the module level for better configuration.
+const URL_CHECK_BATCH_SIZE = 5;
+
 async function checkUrls(paths) {
   const result = [];
-  const batchSize = 5;
+  const batchSize = URL_CHECK_BATCH_SIZE;

📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between b035fc4 and afa5a33.

📒 Files selected for processing (1)

scripts/markdown/check-edit-links.js (1 hunks)

🧰 Additional context used

📓 Learnings (1)

scripts/markdown/check-edit-links.js (1)

Learnt from: anshgoyalevil
PR: asyncapi/website#3557
File: scripts/markdown/check-editlinks.js:58-59
Timestamp: 2025-01-08T15:15:00.759Z
Learning: In the AsyncAPI codebase, batch processing operations (like in the Dashboard script and check-editlinks.js) follow a sequential pattern using await in loops, which is the preferred approach for maintaining consistency across the codebase.

🔇 Additional comments (2)

scripts/markdown/check-edit-links.js (2)

102-135: Well-implemented recursive file processing.

The function is well-structured with:

Proper error handling

Concurrent file processing

Clear documentation

Edge case handling (e.g., _section.md files)

137-163: Clean main function implementation with proper exports.

The implementation is solid with:

Comprehensive error handling

Clear console output

Well-organized exports

Proper module execution check

scripts/markdown/check-edit-links.js

akshatnema · 2025-01-14T16:00:04Z

scripts/markdown/check-edit-links.js

@@ -0,0 +1,163 @@
+const fs = require('fs').promises;
+const path = require('path');
+const fetch = require('node-fetch-2');


Why node-fetch-2 is used here?

#3557 (comment)

https://github.com/asyncapi/website/pull/3038/files#r1655221901

Like we use it in other "js" scripts.

website/scripts/build-newsroom-videos.js

Line 3 in 19c6b07

const fetch = require('node-fetch-2');

akshatnema · 2025-01-14T16:04:22Z

scripts/markdown/check-edit-links.js

+ * @returns {Promise<string[]>} Array of URLs that returned 404
+ */
+async function processBatch(batch) {
+  const TIMEOUT_MS = 5000;


Make it configurable with env variable as well.

akshatnema · 2025-01-14T16:05:18Z

scripts/markdown/check-edit-links.js

+ */
+async function checkUrls(paths) {
+  const result = [];
+  const batchSize = 5;


Kindly make it configurable using env variable.

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (3)

scripts/markdown/check-edit-links.js (3)

7-11: Consider moving ignore list to configuration file.

The ignoreFiles list could be moved to the configuration file (config/edit-page-config.json) to make it more maintainable and configurable without code changes.

102-135: Consider adding additional file validation.

While the implementation is solid, consider adding validation for:

Maximum file size to prevent processing extremely large files

File name patterns to exclude temporary or backup files (e.g., .md~, .md.bak)

137-163: Consider enhancing error reporting.

While the error handling is good, consider adding:

Total number of URLs checked in the success message

Structured error output (e.g., JSON format) for better parsing

Error categorization (timeout vs 404 vs other errors)

📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between afa5a33 and 22a31ce.

📒 Files selected for processing (1)

scripts/markdown/check-edit-links.js (1 hunks)

🧰 Additional context used

📓 Learnings (1)

scripts/markdown/check-edit-links.js (1)

Learnt from: anshgoyalevil
PR: asyncapi/website#3557
File: scripts/markdown/check-editlinks.js:58-59
Timestamp: 2025-01-08T15:15:00.759Z
Learning: In the AsyncAPI codebase, batch processing operations (like in the Dashboard script and check-editlinks.js) follow a sequential pattern using await in loops, which is the preferred approach for maintaining consistency across the codebase.

🔇 Additional comments (3)

scripts/markdown/check-edit-links.js (3)

18-42: Well-implemented batch processing with timeout handling!

The implementation includes:

Configurable timeout via environment variable

Proper request cancellation using AbortController

Comprehensive error handling

Test coverage exclusions where appropriate

49-70: Efficient implementation of concurrent batch processing!

The implementation includes:

Configurable batch size via environment variable

Concurrent processing with Promise.all

Rate limiting prevention with pauses between batches

79-92: Clean and efficient edit link determination logic!

The implementation correctly handles both specific targets and fallback cases, with proper path manipulation.

anshgoyalevil added 6 commits January 8, 2025 13:26

add script

dd0d3ee

add workflow

05a2fe4

fix mdx

6cf369a

fix wf

9675b46

fix wf

c3c8266

fix wf

689a861

anshgoyalevil requested review from derberg, akshatnema, magicmatatjahu, Mayaleeeee and asyncapi-bot-eve as code owners January 8, 2025 15:09

asyncapi-bot requested review from devilkiller-ag and sambhavgupta0705 January 8, 2025 15:09

coderabbitai bot reviewed Jan 8, 2025

View reviewed changes

fix cov

c13eb87

anshgoyalevil commented Jan 8, 2025

View reviewed changes

coderabbitai bot reviewed Jan 8, 2025

View reviewed changes

jest.config.js Outdated Show resolved Hide resolved

asyncapi-bot added the bounty AsyncAPI Bounty program related label label Jan 9, 2025

akshatnema reviewed Jan 11, 2025

View reviewed changes

anshgoyalevil added 3 commits January 14, 2025 06:15

fix comments

a79a06a

add tests

ceae634

Merge branch 'master' into fix_edit_thing

23a3f6d

coderabbitai bot reviewed Jan 14, 2025

View reviewed changes

scripts/markdown/check-editlinks.js Outdated Show resolved Hide resolved

scripts/markdown/check-editlinks.js Outdated Show resolved Hide resolved

add timeout thing

c101b66

coderabbitai bot reviewed Jan 14, 2025

View reviewed changes

akshatnema requested changes Jan 14, 2025

View reviewed changes

fix comments

b035fc4

coderabbitai bot reviewed Jan 14, 2025

View reviewed changes

scripts/markdown/check-edit-links.js Outdated Show resolved Hide resolved

scripts/markdown/check-edit-links.js Show resolved Hide resolved

tests/markdown/check-edit-links.test.js Show resolved Hide resolved

.github/workflows/check-edit-links.yml Show resolved Hide resolved

fix comments

afa5a33

coderabbitai bot reviewed Jan 14, 2025

View reviewed changes

scripts/markdown/check-edit-links.js Show resolved Hide resolved

akshatnema reviewed Jan 14, 2025

View reviewed changes

akshatnema requested changes Jan 14, 2025

View reviewed changes

add env var support

22a31ce

coderabbitai bot reviewed Jan 14, 2025

View reviewed changes

		const batchResults = await processBatch(batch);
		await pause(1000);

-    const batchResults = await processBatch(batch);
-    await pause(1000);
+  const batches = [];
+  for (let i = 0; i < paths.length; i += batchSize) {
+    const batch = paths.slice(i, i + batchSize);
+    batches.push(batch);
+  }
+  console.log(`Processing ${batches.length} batches concurrently...`);
+  const batchResultsArray = await Promise.all(
+    batches.map(async (batch) => {
+      const batchResults = await processBatch(batch);
+      // Introduce a pause between batches if necessary
+      await pause(1000);
+      return batchResults.filter((url) => url !== null);
+    })
+  );
+  // Flatten the array of results
+  result.push(...batchResultsArray.flat());

feat: add workflow and script to check edit links on docs #3557

Are you sure you want to change the base?

feat: add workflow and script to check edit links on docs #3557

Conversation

anshgoyalevil commented Jan 8, 2025 • edited by coderabbitai bot Loading

Summary by CodeRabbit

Release Notes

coderabbitai bot commented Jan 8, 2025 • edited Loading

Walkthrough

Changes

Assessment against linked issues

Poem

Finishing Touches

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

Documentation and Community

netlify bot commented Jan 8, 2025 • edited Loading

✅ Deploy Preview for asyncapi-website ready!

asyncapi-bot commented Jan 8, 2025 • edited Loading

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot Jan 8, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

coderabbitai bot Jan 8, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

coderabbitai bot Jan 11, 2025

Choose a reason for hiding this comment

codecov bot commented Jan 8, 2025 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

aeworxet commented Jan 9, 2025

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

anshgoyalevil commented Jan 8, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Jan 8, 2025 •

edited

Loading

netlify bot commented Jan 8, 2025 •

edited

Loading

asyncapi-bot commented Jan 8, 2025 •

edited

Loading

coderabbitai bot Jan 8, 2025 •

edited

Loading

codecov bot commented Jan 8, 2025 •

edited

Loading